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PREFACE 



The present volume contains a complete translation, made 
in consequence of a suggestion by my eminent friend, Professor 
E. T. Whittaker, F.R.S., of the Italian text of my Lezioni di 
cakolo differenziale assoluto. 1 Two new chapters have been added, 
which are intended to exhibit the fundamental principles of 
Einstein's General Theory of Relativity (including, of course, 
as a limiting case, the so-called Special or Restricted Theory) 
as an application of the Absolute Calculus. 

I have already had occasion to remark in the Preface to the 
Italian edition that we possess various systematic and well- 
written expositions of Relativity by celebrated authors. The 
short treatment which is offered in the two new chapters of the 
present work presents some distinctive features which it may 
be well to point out explicitly. 

In the first place, in order not to increase the size of the book 
unduly, I have thought it expedient to confine myself to tracing 
the relativistic evolution of Mechanics (properly so called) and 
of Geometrical Optics, and to developing its most important 
consequences. In this treatment the whole of Electromagnetism 
is sacrificed. The sacrifice is certainly regrettable, since Electro- 
magnetism was historically related in the most intimate way to 
Einstein's conception, having served indeed as the support and 
model for Restricted Relativity. Furthermore, Electromagnetism, 
in common with every other physical phenomenon, now comes 
within the ambit of General Relativity. Much as the omission 
of Electromagnetism is to be regretted, it has the advantage 
of reducing the programme to subjects belonging to the pure 
Newtonian tradition (or to its developments); and it allows us 
to take a clearer and more exact view of the transition from the 
classical scheme of Mechanics to the relativistic one. 

1 Compiled by Dr, Enrico Persico (Rome, Stock, 1925). 
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For this reason I have followed the method — which I have 
adopted sometimes in lectures or articles on special subjects — 
of taking the classical laws as the starting point and then of 
trying to find inductively what modifications — negligible in 
ordinary circumstances — should be introduced in order to take 
account of Einstein's ideas; and in the first place, naturally, 
to take account of his Principle of Relativity, that is to say, the 
invariant behaviour of these laws under all .transformations of 
space and time, an auxiliary four-dimensional ds 2 being duly 
employed. This method has seemed to me to be preferable to 
the procedure of enunciating the postulates of relativistic 
Mechanics in abstract tensorial form, which is so comprehensive 
in physical content as to be almost inaccessible to ordinary 
intuition, except with ample comment and illustration. 

A further characteristic of our exposition is that we make 
extensive use not only of geometrical representation but also 
of the differential properties pertaining to the space-time con- 
tinuum; attention is drawn also to the special importance of 
the Einsteinian statics, the treatment being rigorous in some 
cases, while in others which involve fields variable with the time 
it is approximate. 

In closing this introduction to Chapters XI and XII I would 
add that they were prepared, still in collaboration with Professor 
Persico, at the suggestion of Mr. F. F. P. Bisacre, M.A. 

In connexion with the whole of the English edition, I must 
warmly thank the translator, Miss Marjorie Long, formerly 
Scholar of Girton College, who with double competence, scientific 
and linguistic, has known how to combine scrupulous respect 
for the text with its effective adaptation to the spirit of the 
English language. 

I owe hearty thanks also to Dr. John Dougall, who, while 
revising the proofs, has checked the analysis throughout, detected 
some oversights, and made many useful suggestions for improve- 
ment. I wish finally to thank my English publishers, who have 
not only acceded to, but almost always anticipated, my wishes 
in regard to symbols and the typography of the book. 



T. LEVI-CIVITA. 
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Riemann's general metric and a formula of Christoffel con- 
stitute the premises of the absolute differential calculus. Its 
development as a systematic branch of mathematics was a later 
process, the credit for which is due to Ricci, who during the ten 
years 1887-1896 elaborated the theory and worked out the 
elegant and comprehensive notation which enables it to be easily 
adapted to a wide variety of questions of analysis, geometry, 
and physics, 

Ricci himself, in an article published in Volume XVI of the 
Bulletin des Sciences Math&natiques (1892), gave a first account 
of his methods, and applied them to some problems in differential 
geometry and mathematical physics. Later on other interesting 
applications, made by himself or his students (to which group 
I had the privilege of belonging), suggested the desirability of 
preparing a general account of the whole subject, including 
methods, results, and a bibliography. This was the origin of the 
memoir " M6thodes de calcul dilf&entiel absolu et leurs appli- 
cations which was compiled by Professor Ricci and myself in 
collaboration, on the courteous invitation of Klein, and appeared 
in Volume 54 of Math. Ann. (1901), 

There is a chapter on the foundations of the absolute calculus, 
with special reference to the transformation of the equations of 
dynamics, in Wright's Tract, Invariants of Quadratic Differential 
Forms (Cambridge University Press, 1908); apart from this, 
while special researches based on the use of this method were 
continued after 1901 by a limited number of mathematicians, 
yet general attention was not again directed to it until the great 
renaissance of natural philosophy, due to Einstein, which found 
in the absolute differential calculus the necessary instrument 

vtt 



Hi PREFACE TO THE FIRST (ITALIAN) EDITION 



for formulating the new ideas mathematically and for the sub- 
sequent numerical work, 

Einstein's discovery of the gravitational equations was an- 
nounced by him in the famous note " Zur allgemeinen Relati- 
vitatstheorie " 1 in the following words: " Sie bedeutet einen 
wahren Triumph der durch Gauss, Riemann, Christoffel, Ricci . . . 
begrtadeten Methoden des allgemeinen Differentialkalculus."" 

In an earlier memoir Einstein had given a new exposition of 
those elements and formulae of the absolute calculus which more 
specifically served his purposes. A similar standpoint was sub- 
sequently adopted by the most distinguished workers in the field 
of general relativity, in particular by Weyl, 2 Laue, 3 Eddington, 4 
and BirkhofT, 5 all of whom made conspicuous original contribu- 
tions, both of idea and of method, to the physical theories, in 
addition to useful and elegant developments of the tensor calculus. 
Similar statements can be made for Carmichael, 6 Marcolongo, 7 
Kopflf, 8 Becquerel 9 - — to mention, from the vast literature on the 
subject, only the books 1 have myself had occasion to consult 
— while de Donder 10 has avoided the notation of the .absolute 
calculus and used instead the theory of integral invariants. 

Iti recent years there have been some general treatises devoted 
to the absolute calculus; for instance, those of Juvet, 11 Marais, 12 
and Galbrun. 13 Lastly, there is another calculus, in a new order 
of ideas, not less comprehensive and perhaps even more general, 
invented by Schouten, and developed with the collaboration 
of Struik. 14 

In face of this plentiful and valuable literature a new dis- 
cussion of Kicci's methods might seem to be superfluous; and 
conceptually this is perhaps true. 

In fact, of the improvements and additions to the scheme 
of 1901 (the memoir in Math. Attn.), derived mainly from the 
notion of parallelism 16 and on this basis introduced by me into 
two courses of lectures given at the University of Rome during 
the sessions 1920-1921 and 1922-1923, all, or almost all, will 
be found as independent discoveries of the authors already cited, 
in one or other of their books. 

For instance, the definition of a tensor, and some algebraic 
anticipations of the results intended to simplify the proofs, are 
to be found in Weyl, Laue, and Marais, all of whom, like Edding- 
ton, establish a more or less intimate connexion between co- 
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variant differentiation and parallelism. A thorough discussion 
of the latter is also given by Juvet and Galbrun. But the associa- 
tion with the algebraico-tensorial notation and with the elements 
of differential geometry is always less detailed and systematic 
than what I tried to establish in my lectures. The line of argu- 
ment followed in them has a particular unity, which may perhaps 
justify their appearance in print at this juncture. 

The manuscript was edited with great care and * intelligence 
by Dr. Enrico Persico, from notes of the lectures. 1 wish to express 
my thanks to him for his valuable help, and to my publisher, 
Signor Stock (who also attended the lectures), to whose continued 
encouragement the existence of the book is due. 

TULLIO LEVI-CIVITA. 

Rome, December. 1023. v 
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PART I 
Introductory Theories 

CHAPTER I . 
Functional Determinants and Matrices 
1. Geometrical terminology. 

In analytical geometry it frequently happens that compli- 
cated algebraic relationships represent simple geometrical pro- 
perties. In some of these cases, while the algebraic relationships 
are not easily expressed in words, the use of geometrical language, 
on the contrary, makes it possible to express the equivalent 
geometrical relationships clearly, concisely, and intuitively. 
Further, geometrical relationships are often easier to discover 
than are the corresponding analytical properties, so that geo- 
metrical terminology offers not only an illuminating means of 
exposition, but also a powerful instrument of research. We 
can therefore anticipate that in various questions of analysis it 
will be advantageous to adopt terms taken over from geometry. 

For this purpose it is essential to adopt the fundamental 
convention of using the term point of an abstract n~dimensional 
manifold {n being any positive integer whatever) to denote a 
set of n values assigned to any n variables x v x 2 , . . . x n . This 
is an obvious extension of the use of the term in the one-to-one 
correspondence which can be established between pairs or trip- 
lets of co-ordinates and the points of a plane or space, for the 
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cases n = 2 and n = 3 respectively. For the case of n vari- 
ables we can thus also speak of a field of points (rather than of 
values assigned to the x's), and of the region round a specified 
point Xi (i — 1, 2, . . . n). 

If the a?'s are w functions x t (t) of a real variable t, then when 
£ varies continuously between t Q and t x we get a simply infinite 
succession of points, the aggregate of which (as for n — 2 and 
n = 3) is called a Ztne, and more precisely an arc or segment of 
a line. 

2. Functional determinants and change of variables* 

Let there be n functions of n variables: 

x 2> • - • 

the functions and their derivatives to any required degree being 

supposed finite and continuous in the field considered. 

To simplify the notation, let x (without a suffix) represent 

not only (as is usual) any one of the n variables x x , x 2 , . . . x )V 

but also (as is sometimes done) the whole set of them; and 

similarly for other letters which will be used farther on. With 

this convention the given functions can be written in the abridged 

form: , v 

u,(x). 

With the usual notation, the functional determinant or Jacobian 
of the u*b is the determinant of the nth order whose terms are 
the first derivatives of the t^'s; i.e. 



D = 



du x 






dx x 


dx 2 


dx u 


du 2 


du 2 


du 2 


dx 1 . 


dx 2 




du» 


du n 




dx 1 


bx 2 





Such a determinant is sometimes represented by the abridged 

notation . , 

/u ± u 2 . . . u\ 

\x x x 2 . . . x n / 
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analogous to that used for fractions and substitutions, the set 
of functions u representing the numerator and the set of vari- 
ables x the denominator of a fraction. The analogy of form 
is justified by the analogy of properties, as can be seen by con- 
sidering the effect on a functional determinant of a change of 
variables. For let the x 9 a be functions of n variables y, 



(i) 



and suppose further that these equations represent a reversible 
transformation, i.e. that they also define the y 9 s as functions 
of the #\s, or, in other words, that they are soluble with respect 
to the #'s. If then the u 9 s are considered as functions of the 
y 9 s {being given in terms of the x% which are functions of the 
and the corresponding functional determinant 



Vi ■ • • yJ 



is formed, it will be found, as will be shown below in § 4, 
that D x ■= D multiplied by the determinant of the functions 
defined by equations (1), i.e. by 

A = " " * Xn \ 

\Vi • ■ ■ yJ' 

3. The fundamental theorem on implicit functions. 

Before proving the theorem just referred to, we must recall 
a fundamental theorem relating to implicit functions. It is known 
that a relation between two variables of the type 

f(y, x) = 0 

defines y as a function of x, provided certain suitable qualitative 
conditions are satisfied. 1 A classical form ; of the conditions 
sufficient for solubility is as follows. Let x°, be a point at which 
/ vanishes, / being finite in a (plane) region / round the point. 

Let exist in I and be not zero for x — x°, y = y°. Then 
dy 

1 When an equation is said to be "soluble ", this will not necessarily mean that 
the process of finding an algebraic solution can be carried out. 
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in a certain (linear) region round the value ofi the given equation 
defines a continuous function y(x) such that/(y(;r), x) vanishes 
identically. 

For implicit functions of several variables the following 
theorem, which is a generalization of the one just stated, holds. 

Let there be given n equations betweep. n variables y and 
any number of variables x of the form 



Let there be a set of values y°, which satisfy these equa- 
tions; in a region round the point x°, y°, let the /'s and their 
derivatives with respect to the yn be continuous, and let the 
determinant , - - , 



be not zero. Then the given equations define the y 9 s as functions 
of the x's in a region round the set of values x°. 

It will be seen that from a certain point of view the func- 
tional determinant of several functions of the same number of 
variables constitutes a natural generalization of the derivative 
of a function of one variable. This will follow explicitly from the 
applications of the following section. 

4. Effect on a functional determinant of a change of variables. 

Consider first the (sufficient) condition of solubility of the 
set of equations (1). Write the equations in the form 

- • • ?/J — a?* = 0 (i = 1, 2, . . . n), 

and suppose that there exists at least one set of values of the 
2/'s and the .r's which satisfy them and for which the functions 
x { (y) and their derivatives are continuous. Then, to apply the 
preceding theorem, we must calculate the partial derivatives of 
the left-hand side of each equation with respect to the j/s, and 
form their determinant. But these derivatives are the terms 
^ x 

%y ^ ' ~ 2 ' n ^ anC * * ience fc ^ e con ^^ on °* solubility with 
respect to the ys is 



f,(y\x) = 0 (i -= 1,2 



. n). 
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Now take the theorem stated in § 2, and suppose A 4= 0. 
Multiply together the two determinants D and A, i.e., inter- 
changing rows and columns in A, form the product 







3% 






0X2 




dx x 


dx 2 


dx n 




hi 


Vyi 


tyi 


du 2 


du 2 


3^ 




dx t 


dx^ 




dx x 




dx n 


X 




%y% 












dx ± 






dx x 


dx 2 


8x tl 








dy* 



Applying the ordinary rule, the product by rows gives as 
the typical element a rs of the resulting determinant the expression 

x'dxt dy H 3y, 

(remembering the rule for differentiating a function of one or 
more functions). Hence, as already stated, the product is the 
determinant D ± . This result is expressed by the formula 

\x Y . . . xj \y x . . . yj \y x . . . yj 9 " 

which justifies the use of this notation for the functional deter- 
minants. 



5. The necessary and sufficient conditions for the independence 
of n functions of n variables. 

If therefore the functional determinant of n functions of n 
variables does not vanish identically, it follows that this pro- 
perty still holds when the original variables are replaced by 
others related to the first set by the transformation (1) (with 
the condition A 4= 0); in other words, this is an invariant property. 
The following definition may therefore be given: 

Definition. — n functions ofn variables are said to be indepen- 
dent when their functional determinant does not vanish identically. 

The reason for applying the word " independence " to this 
property is shown by the following theorem. 
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Theorem. — Given n functions u of n variables x, the necessary 
and sufficient condition for the non-existence of any (differentiable) 
relation between them of the type 

f(u lf u 2 , . . . u u ) = 0 .... (3) 

involving only the u's and not the x's, is that their functional deter- 
minant does not vanish identically. 

We shall first show that the condition is sufficient; then that 
it is also necessary, but for the moment confining the proof to 
a particular case; the theorem in its general form will be shown 
farther on to be itself only a particular case of another still 
more general theorem (cf. § 7). 

Suppose the condition satisfied 

NO/] ... *^n' 

We shall then show that no relation of the type (3) can exist. 
(Identities are of course not considered; i.e. we exclude the case 
where equation (3) is satisfied when arbitrary values are assigned 
to the ?*'s, as it would not then represent any relation between 
the u's.) Suppose that such a relation does exist. Differentiating 
with respect to x v . . . x u , we should get n equations 

i du a dx i 

linear and homogeneous in the derivatives Now since by 

hypothesis / is a true function, not zero identically, these deri- 
vatives are not all zero. Hence the determinant of the coefficients 
of this group of equations vanishes; i.e. D = 0, which is con- 
trary to our hypothesis. The condition (4) is therefore sufficient 
to secure the non-existence of any relation of the type (3). 

To prove that condition (4) is necessary, we shall show that 
if it is not satisfied, i.e. if 

2> = 0, (5) 

then the w's are connected by a relation (at least one) of the 
type (3). For the moment the only case considered will be that 
in which at least one of the minors of order n — I oi the deter- 
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minant D does not vanish. This minor will in general be of the 
type 



\ X P X X P<1 • • ■ x p*-lf 



where 3^, . . . p n _ t and ft, 1 represent any two arrangements 

ol n — 1 integers chosen without repetitions from the numbers 
1, 2, .... w. But since the order in which the x's and u y s are 
made to correspond to the numbers 1,2,. . . n is immaterial, we 
can, without loss of generality, suppose numbers assigned to the 
variables in such a way that Z)' is the minor formed by the first 
n — 1 rows and n — 1 columns; we thus get 



D' — ( Ul ' U »A =# 0. 

. . . x n _ 1 ' 



(6) 



This condition expresses the fact that no relation exists 
between the first n — 1 functions. 

Now we know that if a reversible transformation is applied 
to the sr's, it follows from hypothesis (5) that the determinant 
of the u*s with respect to the new set of variables y is also zero. 
Let the relation between the as's and */'s be given by the following 
equations: 



'Un-l — u n-l( x H • • • x u)y 



(7) 



We may note thau these formulae define a reversible trans- 
formation, since the functional determinant of the y's with respect 
to the x's is 









dx x 




dx n 












dx n 


0 


0 


1 
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Theorem. — Given n functions u of n variables x, necessary 
and sufficient condition for the non-existence of any (differentiable) 
relation between tliem of the type 

/K, ^ 2 > • - - u n) = 0 .... (3) 

involving only the u\s and not the x's, is that their functional deter- 
minant does not vanish identically. 

We shall first show that the condition is sufficient; then that 
it is also necessary, but for the moment confining the proof to 
a particular case; the theorem in its general form will be shown 
farther on to be itself only a particular case of another still 
more general theorem (cf. § 7). 

Suppose the condition satisfied 



/u x . . . u„\ 
\X} . . . x n / 



+ 0 (4) 



We shall then show that no relation of the type (3) can exist. 
(Identities are of course not considered; i.e. we exclude the case 
where equation (3) is satisfied when arbitrary values are assigned 
to the u's, as it would not then represent any relation between 
the u's.) Suppose that such a relation does exist. Differentiating 
with respect to a^, . . . x n , we should get n equations 

v V 9*. = o (i =- 1,2, . . .n), 
i du a dx h 

linear and homogeneous in the derivatives Now since by 

hypothesis / is a true function, not zero identically, these deri- 
vatives are not all zero. Hence the determinant of the coefficients 
of this group of equations vanishes; i.e. D = 0, which is con- 
trary to our hypothesis. The condition (4) is therefore sufficient 
to secure the non-existence of any relation of the type (3). 

To prove that condition (4) is necessary, we shall show that 
if it is not satisfied, i.e. if 

I> = 0, (5) 

then the w's are connected by a relation (at least one) of the 
type ( 3 ). For the moment the only case considered will be that 
in which at least one of the minors of order n — 1 of the deter- 
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minant D does not vanish. This minor will in general be of the 
type 

where p l9 . . . p n _j and q l9 . . . q n _ x represent any two arrangements 
of n — 1 integers chosen without repetitions from the numbers 
1, 2, .... n. But since the order in which the sc's and u 9 & are 
made to correspond to the numbers 1, 2, ... n is immaterial, we 
can, without loss of generality, suppose numbers assigned to the 
variables in such a way that D' is the minor formed by the first 
n — 1 rows and n — 1 columns; we thus get 

D' = ( Ul ' ' ' U,l ~ 1 ) 4= 0. ... (6) 

This condition expresses the fact that no relation exists 
between the first n — 1 functions. 

Now we know that if a reversible transformation is applied 
to the x'h, it follows from hypothesis (5) that the determinant 
of the w's with respect to the new set of variables y is also zero. 
Let the relation between the x's and y'& be given by the following 
equations: 

Vi = M x i> • ■ • x »)> 



... 



We may note that these formulae define a reversible trans- 
formation, since the functional determinant of the y's with respect 
to the x's is 







3mj 


















dx n 


0 


0 


1 
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and expanding this from the last row, it is seen to be equal to 
D\ which by hypothesis is not zero. 

Now consider the w's as functions of the y's; using equations 
(7) we get 

| ... (8) 

u u = u n (y L , . . . jf.,-1, y tt )J 

Expressing the fact that the determinant of the u'& with 
respect to the is zero, we get 

1 0 0 0 

0 1 0 0 



0 0 ... 1 0 

It follows that the last of the equations (8) does not contain 
y n ; substituting in it from the remaining equations, it becomes 

' " w.Ki • • • «*, -i)> 

i.e. a relation between the w's which does not contain any of the 
x's. 

Hence from the hypotheses (6) and (5) it follows that there 
exists one relation of the type (3), which is such that u H can be 
expressed in terms of the other w's. This relation is unique, 
because if there were another, then eliminating u n between them 
we should get a relation between u x , . . . u„ 1 ; but this, as already 
pointed out, is incompatible with hypothesis (6). 

G. Functional matrices. Definition of the independence of 
m functions of n variables. 

We shall now examine the more general case in which the 
number m of the functions u is not equal to the number n of the 



= = o. 
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variables x. For this purpose we must consider the functional 
matrix of the given functions, i.e the following matrix of m 
rows and n columns: 



du x 

dx x dx% 



3 2^ dx 2 



dx n 



In what follows it will be denoted by M\ but it must be noted 
that no numerical value is attached to the symbol, and there- 
fore that M does not represent a quantity, but is an abbreviation 
for the arrangement of terms under consideration. 

The characteristic of a matrix is the order of the non-vanishing 
determinants of highest order which can be constructed from it; 
it can therefore obviously not be greater than the number of 
rows or the number of columns, whichever is the less. 

We now give a definition, which will be justified in the follow- 
ing section. 

Definition. —in functions of any number of variables are said 
to be independent when the characteristic of their functional matrix 
is m. It follows immediately that if the number of functions is 
greater than the number of variables, the functions cannot be 
independent; while if the two numbers are equal, the definition 
coincides w T ith that already given, since the matrix becomes a 
determinant of order m, and if its characteristic is m this is 
equivalent to saying that the determinant does not vanish. 



7. Theorem. 

Given m functions u of any number of variables x, if the 
characteristic of their functional matrix is k, then there are m — k 
relations (and not more) between the u's which do not involve 
the it's. 

It will follow immediately as a corollary that if the functions 
are independent (the case k = m) there exists no relation between 
them. 

The theorem just stated has been proved above (§ 5) for the 
particular cases in which the number of functions is equal to the 
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number of variables and in addition i=mori=w-l. 
We proceed to prove it in general, taking various cases in turn, 
as follows: 

(1) k = m (and m < n) 9 the case of independence; 

(2) k < m. | (26) t < ^ 



Case (1): & — m. This hypothesis is equivalent to saying 
that there exists a minor of order rn which is not zero; remember- 
ing the remark made on p. 7, we may suppose without loss of 
generality that 

M 4= 0. 



Applying the theorem of p. 6 it follows that the u 9 s are not 
connected by any relation which does not involve any of the 
sc's. 

Case (2a): k < m, k = n. There is therefore a minor of 
order n which is not zero. We may arrange the suffixes of the 
m 9 b and the x's so that the minor in question is that formed by 
the first n rows and n columns, and we shall have 



D = fa • • • M 4= 0. 

\ x l • • • x n/ 



We shall now show that u n+1 , u„+ 2 , . . . u m can all be expressed 
in terms of the remaining u'&, without using the sc's, so that we 
shall have m — n (which is the same as m — k) relations between 
the w's. For since D 4= 0 we may change the variables. Let 
the new variables be given by the equations 

™i = ^i^, • - • x n ), 

Un — Ud%l> • - . X n ). 

Solving these equations with respect to the x's, and substituting 
the expressions so obtained in u n . hl , ...u tn , these will be expressed 
as functions of u l9 . . . u n ; hence the theorem is true for this 
case. 
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Case (26): k < m, k < n. The hypothesis is that there exists 
a determinant of order k which is not zero, and that every deter- 
minant of higher order vanishes. Let us arrange the u'a and the 
x 9 s so that 

D= ^"-^Uo (9) 

. . . x k ! 

We shall show that any function u h (h = k + 1, • - - m) 
can be expressed in terms of the first k functions u, without 
involving any of the x's. For this purpose, consider the deter- 
minant 0 formed by bordering D with the (A; + ] )th column 
and Ath row of the matrix; since it is of order k + 1, it is 
zero by hypothesis, i.e. 

0 = /u, . . . u k u h \ Q (10) 
Xa^ . . . x k x k+1 / 

Now applying the theorem stated on p. 6, it follows from this 
equation and the inequality (9) that u h can be expressed as a 
function of u v . . . u Jn which does not involve x l9 . . . x k9 x k ^\ i.e. 
since we are not yet able to say anything about the remaining 

sc's, 

u h ...u k \ x k+2 , . . . a?J. . . (11) 

The next step is to show that x k + 2 , . . . x lt do not in fact occur 
in this expression. If n k + 1> there is no need to consider 
x k+2" - ' • x n> an d therefore the formula (11) represents the expres- 
sion we are in search of, giving u h in terms of 14j 5 mm* U k alone. 
If this is not so, let Xj denote any of the variables x k + 2y . . . x n9 
and consider the determinant 0', obtained from 0 by replacing 
x k+1 by x J9 so that 




0' vanishing because it is a minor of order k + 1 taken from the 
matrix. Expanding it, substituting from equation (11), and 
making certain transformations, we can easily show that it 

involves the vanishing of whence it follows that <f> does not 

~ dXj 

contain x. In fact, representing compactly by the letter D the 
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square matrix of those elements of 0 which form the determinant 
D, we have 

dx, 



0' = 



3% 
dx 1 



du k 
dxj 

dx k dxj 



Using equation (11) the elements of the last row are given by 

i du L ox, 



3% d<f> ^_ £ d<f> du L 
dxj dXj \ l dui dxj 



Multiplying the elements of each of the first h rows in turn 



by 



d<f> 



d<f> 



, and subtracting the sum of these products from 



the elements of the last row (which does not change the value 
of tin determinant) the last row becomes 



0 



0 



dxj 



and therefore, expanding from this row, we get 

0' dc f> 



dx, 



Since by hypothesis D 4= 0, it follows that 



d<f> = 
dxA 



0, which 



proves the assertion. 

The theorem enunciated at the beginning of this section is 
thus completely proved. Applying it to the particular case 
m = n, it coincides with the theorem of p. 6, which is therefore 
now shown to hold without any restriction. 
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CHAPTER II 
Systems of Total Differential Equations 
1. Preliminary remarks. 

The reader may first be reminded of some general considera- 
tions on differential expressions. 

Given a function f(x Xy x 2 , . . . x n ), the expression 

df == i* f to* 

i dx, 

is called the total differential of the function /; it is equal (except 
for infinitesimals of higher order) to the increment of/ in passing 
from the point x v x 2 , . . . x u to the infinitely near point x 1 + dx lt 
x 2 + dx 2 , . . . x n + dx ir 

Given n functions X, of the #'s 5 which, together with their 
first derivatives, we shall suppose finite and continuous, the 
expression 

<p = 2, X t (x v x 2 , . . . x n ) dxi . . . (1) 
i 

is called a differential, or Pfajffian, expression. 

An expression of this form is not always an exact differential; 
i.e. there does not always exist a function f(x l9 x 2 , . . . x n ) such 
that the given Pfaffian is its total differential. The necessary 
and sufficient condition for the existence of such an/, i.e. for the 
integrability of an equation of the type 

df i,X t dx, (2) 

1 

is that the following %n(n — 1 ) conditions should be satisfied: 

dX J= **i l,2,...n). . . (3) 

dx f cx t 

If these conditions are satisfied in a certain field, the integral 
calculus shows how to construct the most general function / 
which has the required property; i.e. it shows how to integrate 
the given differential expression. All the possible /'s differ from 
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one another by a constant. If we follow the procedure usual in 
elementary treatises, and consider not the whole field but a suit- 
ably restricted region round a point x arbitrarily fixed in advance, 
then in this region each of the /'s is a uniform function (i.e. one- 
valued, like all the functions we are considering) of the argu- 
ments x v x 2 , . . . x n . 

We now proceed to discuss a more general problem than this. 
Let there be m unknown functions u of n independent variables 
x, and let there be given a set of relations between their differ- 
entials which define the dus in terms of the dec's, in the form 

du a = 2, X a]i (x | u) dx £ (a = 1, 2, . . . m), . (4) 

i 

where the X 9 & are mn arbitrarily assigned functions (finite and 
continuous, together with their first derivatives). 

A group of relations of the type (4) is called a system of 
total differential equations 1 ; equation (2) is obviously only a 
particular case. It may be remarked that equation (2) is itself 
equivalent to the system of n equations 

l f - X t (x) (i = 1,2, . . . n), . . (2') 

V X l 

and that the equations (4) are analogously equivalent to the 
system of mn equations 




Both are problems of partial differential equations, and are 
soluble only under specific conditions; but if these are satisfied, 

1 In a system of this kind the group of variables to be considered independent 
is fixed in advance. The late Professor G. Ricci in a recent work has considered 
instead a system of / equations of the type 

2a rg (x)dx a =0 (r = 1, 2, ...J), 
i 

determining the conditions that the n variables x may be considered functions of 
any number p(<n) of independent variables, and indicating the stops necessary 
to find the solution (ef. Atti del Reale 1st. Ven., Vol. XXXI. 1922-1J, pp. 179-183). 

An account of the general theory of Pfattian systems, with recent develop- 
ments due mainly to von Weber, Curtan, and Goursat, is given in the last-named 
author's Lecon» sur le probUme de Pfaff (Paris, Hermann, 1922). 
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we shall see that the integration reduces to that of ordinary 
differential equations. 

2. Conditions necessary for integrability. Completely in- 
tegrate, or complete, systems. 

When the problem is stated in the form (4'), it is obvious 
(from the symmetry of the second derivatives of the u'a) that 
a necessary condition for the existence of solutions is that the 
following conditions shall be satisfied: 

= dX «U ( a = 2 . - • • m \ m ( 5 ) 
dx 3 dx t = 1, 2, . • . n /' 

The symbol denoting total differentiation has been used as 
a reminder that in differentiating it is necessary to take into 
account that the arguments u also depend on the as's, i.e. that 

dxj dXj i* dup dxj 

Using this result, the equations (5) take the form of \mn{n — 1) 
relations of the type 

F(x J u) = 0 (5') 

These, it will be seen, in general contain not only the x's but 
also the w's (unlike the equations (3)); and we must suppose the 
it's replaced by those unknown functions of x which satisfy the 
given system of equations. The conditions of integrability can- 
not therefore be given explicitly without knowing beforehand 
the solutions of the system. This difficulty did not arise for the 
equation (2), since the X's, and therefore their derivatives, did 
not contain the unknown function. 

But it may happen— and this is the most interesting case— 
that the equations (5) are not only satisfied for those particular 
u\ which form a solution of the system, but are true identically, 
i.e. for any set of values whatever of the u 9 s and of the £r's. In 
this case, as we shall see, these conditions are not only necessary, 
but also sufficient, for the integrability of the system, which is 
then said to be co7npletely integrable, or complete. 
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3. The integration of a mutually consistent system can 
always be reduced to that of a complete system. 

We shall now show that whenever a system of total differential 
equations is integrable (in the sense that there exists at least one 
set of m functions u a {x li a? 2 , . . . x n ) which satisfy the system), 
the integration reduces to that of a complete system; we shall 
thus be able to confine our subsequent discussions to systems of 
the latter kind. 

As we have already said, there are \mn(n — 1) conditions of 
integrability (5'), while there are m w's. Now for n > 2, 
m < %mn(n — 1). In general, therefore, there cannot be m func- 
tions u w T hich satisfy these conditions, and therefore the system 
can certainly not admit of solutions. If exceptionally these con- 
ditions are mutually consistent it may happen either that m of 
them are independent, so that there is then one single set of 
values for the i^s which satisfies these m conditions, and it only 
remains to test whether these u's also satisfy the given system 
of equations; or that they are all satisfied identically (and then 
the system is complete); or that—the most general case— they 
reduce to a number v < m of mutually consistent and indepen- 
dent equations. In the latter case, v of the unknowns can be found 
in finite terms, expressed in terms of the x 9 & and the remaining 
m — v — ^ unknowns. Arranging the u's in a suitable order,, 
we may suppose that the equations (5') give us the last v of thf 
functions u, viz. the functions 

in terms of the #'s and the remaining u% 

For greater clearness, we shall denote these first it functions 
u °y u 'a ( a 1 5 2, . . . /x), and the last v by u^' = u^ +fi 
(/? = 1,2,... v). Using this notation, the equations (5') can 
be put in the form resolved with respect to the namely 

< (jS = 1,2 u). . (5") 

Next, suppose the system of equations (4) divided into two 
groups; one consisting of the first /z: 
/ n 

du a = 2, X a]i (x | u) dx L (a « 1, 2, . . . /*); . (4a) 

1 CD«&5) 
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and the other of the remaining vz 

n 

du a = S, : J5C a( , (x \u)dXi (a = ft + 1, ft + 2, . \ . m — /x + 
1 

The latter group, putting a — ft + ft we shall write 
in the form: 

- | t * M+/3|t (a I u) d Xi 0=1,2,... „). . (46) 

Substituting from the equations (5") and (4a), the two sides 
of this last equation become linear expressions in the differentials 
dx l} with coefficients which depend solely on the sc's and the w's. 
Since the coefficients on both sides must be the same (the differ- 
entials dx t being independent), the equations (46) reduce to 
equations in finite terms, nv in number, between the u"& and 
the j;'s. 

If all these reduce to identities, we need only consider the 
system of equations (4a), in which the functions u" are to be con- 
sidered as replaced by their values as given by the equations 
(5"), so that we have a total differential system, of the same 
form as the original system (4), involving only the u"s, /jl in 
number, where ft m— v < m. The essential result in the 
case under consideration is that the system (4a) so reduced is 
necessarily complete. In fact, it consists of a part of the original 
system (4) with the additional relations (5") between the w's. 
The condition of intcgrability of the whole system (4) (where 
a priori the ?/'s were treated as so many unknowns) consisted of 
the equations (5), or, we may say, of the equivalent equations 
(5"). For the system of equations (4a) the analogous conditions 
will consist of a part of the conditions (5") (or combinations of 
these), with the proviso that every u" is to be replaced by the 
cot responding expression given by the equations (0") themselves. 
This process obviously leads to mere identities; hence, as stated, 
the system (4a) is complete. 

If on the other hand the equations (46) give rise to non- 
identical relations in finite terms between the u"s and the se's, 
we shall have to associate them with the equations (4a) and 
treat this whole system of equations in /x unknowns (including 
some total differential equations and some equations in finite 

( I> (565 i 2 
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terms) as we have already treated the system of equations (4) 
and the conditions (5). 

Proceeding in this way, we shall reach a stage where either 
the conditions are found to be mutually inconsistent, when we 
must conclude that the given system has no solution/ or else the 
problem reduces to the integration of a complete system (with a 
number of unknowns which is certainly less than m ). Q.E.D. 

In consequence we shall now confine our attention solely to 
complete systems. 

4. Bilinear covariants and the resulting form for the conditions 
of complete integrability. 

We have expressed the condition of complete integrability 
by means of the equations (5), which are supposed to hold for 
arbitrary values of the u's and of the sc's. We shall now express 
this condition in a more concise form. 

For this purpose take two different systems of infinitesimal 
increments of the x's, denoted by dx, and 8x, respectively; the 
corresponding increments of a generic function u of the x's will 
then be denoted by du and 8u respectively, and will be given 
by 

du = £ ; dx n 
i Cxi 

\ (7) 

8u = lt t 8x t . 

1 OXi J 

Now the dx J B are arbitrary infinitesimals, on which we can 
a priori impose any hypotheses we please; we shall consider 
them as infinitesimal functions of the afs. With this hypothesis 
the increments of these rfx's, corresponding to the increments 
8x of the variables, will naturally be denoted by 8dx\ with a 
similar interpretation for d8x. The increment du will also be an 
infinitesimal function of the x's, and we shall thus have to con- 
sider 8du; d8u will be similarly defined. We shall next obtain 
the explicit expression of these two second* differentials of u, in 
order to show that a slight restriction on the arbitrariness of 
the second differentials of the independent variables will be 
sufficient to ensure the result 8du = d8u, whatever the 
function u may be. 
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Applying the symbol of operation S to the first of the equations 
(7), we get (without any restrictive hypothesis) 



8du = 2 4 8 ( ^L) dxi + ± A ° u 8dx t 



du 
'dx. 



= S, S, * * dx t 8xj + S, I U 8dx t . . . (8) 
i i dx.dXj J i 

The expression for d8u will evidently be similar, with d and 
8 interchanged. Now the first part of the formula is unaltered 
by this interchange, while in the second 8dx t is replaced by 
d8x r If therefore we impose on the arbitrary functions dx and 
Sx of the sc's the condition 

dSx t = 8dx, (i = 1, 2, . . . n), . . (9) 

which represents a very small loss of generality, the second part 
of the formula (8) will also be unaltered when d and 8 are inter- 
changed; we shall therefore have, for any function whatever 

X%> * • • 

d8u = 8du (10) 

It may be noted incidentally that in the differential calculus 
it is usual to impose a hypothesis involving considerably greater 
restrictions than the conditions (9); the usual convention is 
that the second differentials of the independent variables are 
zero, or that the dx's are not functions of the x's, but constants. 

We shall now consider, along with the increments of the 
independent variables, not a function a with its differentials, 
but a generic Pfaffian 

i 

in which the X'a are given functions of the x's. 

The suffix d has been inserted as a reminder that the Pfaffian 
refers to the increments dx; y the same Pfaffian relative to the 
increments 8x L will be conveniently distinguished by the analogous 
notation 

n 

fa ^iX,8x r 
i 
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Both ift d and i/r 6 will naturally be functions of the x 9 &. Cal- 
culating &^ we thus get 

8£, = E, 8X, dx, + 2, X k Ux, - E, S, V?:' ^ 8^ + 2, X,. Mi, ; 



i r 



or with the abridged notation which can be used when several 
summations between the same limits are applied to the same 
general term, 

n dX- n 
i ox J i 

Interchanging and S we get d0 6 . Using the relation (9), 
the difference ?»fs d — dtfj s reduces to 

But the value of a sum is plainly unaffected by the parti- 
cular letters of the alphabet which we choose to assign to the 
suffixes with respect to which the summation is to be made. 
We may therefore interchange i and j in the second part of the 
preceding formula, so that we can now write the equation in the 
form 



#a S fJ - * X ') dxM r ■ (ID 
i \dxj dxj 



The expression &fi <t - dip s is called the bilinear covariant 
relative to the given Pfaffian. The use of the term " bilinear " 
is sufficiently justified by the expression just found, which is 
linear in the arguments dx and also in the arguments Sx. The 
name " covariant " is due to the circumstance that the numerical 
value and formal structure of the two sides of equation (11) 
always remain the same when the independent variables x vary 
in any way whatever. But we shall return to this point farther 
on (cf. Chapter VI, p. 144) in connexion with the general idea of 
invariants (functions or differential forms). 

Meanwhile it may be noted that if the Pfaffian ift d is an exact 
differential, i.e. if the conditions (3) are satisfied, the right-hand 
side of equation (11) becomes zero, and we reach a result which 
has already been found (cf. formula (10)). 
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We may now return to the examination of the system of 
equations (4), and the conditions of complete integrability. Con- 
sider the m Pfaffians which constitute the right-hand sides of 
the equations (4): 

and construct their bilinear covariants. We shall show that the 
two conditions: (a) that these covariants vanish identically, 
however dx and Sx are chosen; and (6) that the equations (5) 
are identically true whatever values are assigned to the u's, 
are completely equivalent, so that the condition of complete 
integrability may be written in the form 

8$;>- (hjjf = 0 (a 1, 2, . . . m\ . (12) 

it being understood that this equation must hold for arbitrary 
values of the increments dx and Sx. 1 

To prove this, take the explicit expression of these bilinear 
covariants, in the form given by equation (11). In differentiating 
it must be remembered that the X's must be considered as 
functions of the x's, both directly, and also indirectly as functions 
of the ti's. Using the convention already adopted, the derivatives 
can therefore be denoted by £he symbol for total differentiation; 
equation (12) thus becomes 

Now if the conditions (5) for complete integrability are satisfied, 
the coefficients of this bilinear form (i.e. the expressions in paren- 
theses in equation (12')) are all zero, and therefore the equation 
is satisfied however the dx's and S#'s are chosen. Vice versa, 
suppose that the equation is satisfied however the dx's and 8x's 
are chosen. Then all the coefficients must necessarily be zero. 
For if we take all the dx's and 8x's as zero, except one pair, 
e.g. dx„ Sx jy where t 9 j, are two arbitrarily chosen but definite 

1 As a matter of fact we have imposed the restrictions (9) on the second 
differentials 5</r lf d5x t , but the infinitesimal increments dx^ 8x t to be assigned to 
the a^'s at the generic point under consideration are still entirely arbitrary, 
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integers of the series 1, 2, . . . n; then the sum in equation (12') 
reduces to the single term 



/dX a}i _dX ai \ g 
\ dx J dxt ) * 39 



which cannot vanish unless 



dX a \i dX a 



dxj dx z 

We therefore conclude that the conditions (5) can be written in 
the more concise form (12). 

5. Morera's method of integration. 1 

We shall now show that the conditions of complete integrability 
are sufficient for integrability, or more precisely that if they are 
satisfied there exists one and only one set of m functions u(x) 
which satisfy the given system of liquations and have values 
arbitrarily fixed in advance at a point also fixed in advance. 
Considering these initial values of u as arbitrary constants (as 
evidently they may be considered to be), we can say more 
shortly that the general integral depends on m arbitrary constants, 
or that there are oo w integrals. 

For the proof, we first fix a generic point -P 0 ( x i> x", . . . x^), 
in the field of variation of the a?'s in which the X's are defined. 
Let P^x], ) be another arbitrary point in the field, and 

suppose it joined to P 0 by a line T which does not leave the 
field. T will be defined by parametric equations 

x t = £(0 (i = 1, 2, . . . n), . . (13) 

where t is a parameter which has the value t 0 at P 0 and the value 
t ± at P x . We shall provisionally confine our investigation to the 
points of this line, so that for the present any functions u of the 
x'a are to be considered as functions of the variable t alone (via 
the x'b and the equations (13) ). Their derivatives will be 

du n _ * du a dx t _ 

dt ~ Tdx £ dt (a ~ 1 ' 2 '"- m >» 

141 Zur Integration der vollstandigcn Differentiate in Math. Ann., Vol. 27, 
1886, pp. 403-411. Cf. also Skvkki: "Sul nietodo di Mayer per l'integrazione 
delle equazioni lineari ai differenziali totali", in Atti del JL 1st. Veneto, Vol. 
LXTX, 1910, pp. 419-426. 



TOTAL DIFFERENTIAL EQUATIONS 



Ml. 5 



or, denoting differentiation with respect to t by a dot, and sub- 
stituting from equation (4'), 

du n 

dt ~ ^^ a \ iXi > (1*) 
du 

OT dt = dt {a = h % ' ' ' m) ' ' (W '> 

The x?8 are known functions of I given by equations (13); 
hence the equations (14/) are of the type 

~ - U a (t I u ± , u 2 , . . . u m ) (a - 1, 2, . . . m), (14'') 

i.e. they form a system of ordinary differential equations, in the 
norma] form. Now given m arbitrary constants 
it is known from the calculus that — subject to qualitative eondi 
tions of continuity and existence of derivatives, which we suppose 
satisfied-— there exist m functions u a (t) which satisfy the system 
(14"), and which are equal to the given constants when t = it 0 . 
If, therefore, the tt's are given any arbitrary set of values at P 0 , 
they are defined at all points of the line Z\ and therefore also 
at P x . It may however happen— and does in general— that if 
the points P Q and P x are joined by another line instead of T, 
different values will be found for the u's at P x . But we shall 
now show that if the conditions of complete integrability are 
satisfied, the values of the u'& at P l9 found by the method just 
described, are independent of the line T, so that these tt's will 
be functions only of the co-ordinates of P l9 that is, functions of 
position; they will satisfy the given system of equations not 
only along a line, but along all the infinite number of lines which 
can be drawn in the given field, or, in other words, in the whole 
of this field. They will therefore constitute the required solutions 
of the total differential system (4), as we shall show later on. 

We shall simplify our task by considering infinitesimal dis- 
placements; i.e. by showing in the first place that the values of 
the u'& at P x remain unaltered if the line T undergoes an infinitesi- 
mal deformation; it will follow that they will be the same for 
any line which can be obtained from T by a succession of infini- 
tesimal deformations, i.e. by a continuous deformation of T. 
If then we suppose the field such that every line joining P Q and 
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P x can be obtained in this way, we shall have all that is required. 
Such fields (e.g. a triangle or a circle in a plane, a cube or a sphere 
in space) are called simply connected. 

Consider therefore a line T' infinitely close to T; we may 
think of it as obtained by displacing each point P of T, of co- 
ordinates x t , to a point P' of co-ordinates x, + hx n and the 
infinitesimal increment hx L may be taken in the form e^, for 
example, where every Xi is a finite quantity varying from point 
to point of the curve (and therefore a function of t), and e is 
an infinitesimal factor taken as constant, and therefore indepen- 
dent of U With these conditions the parametric equations of 
the curve T' will be 

Xt+Sxi - h(t)+c Xl {t). . . . (15) 

The functions Xi ma y be considered as arbitrary, except for 
the condition of vanishing for t — t 0 and for t = £ 1? so that the 
lines T and T' may have the same extremities. We shall adopt 
the natural convention of using the operator 8 to denote the incre- 
ment of a generic quantity (scalar or vector) in passing from the 
point P of T to the corresponding point P' of T'. 

Now suppose the equations (14") integrated along T'\ we 
shall get functions of t, u a + bu a9 satisfying the equations 



(« a + 8u a ) = > + (a = 1, 2, . . . m), 



dt dt ^ d 



using hypothesis (12), expressing the complete integrability of 
the system, we can also write the equations in the form 

. (") 

dt dt v ' 

From the theorem of the existence of integrals of ordinary 
differential systems (already referred to in connexion with 
equations (14') ), it follows that the quantities Su a are uniquely 
determined by these equations together with the condition of 
vanishing at P 0 . Now the equations (16) are obviously satisfied 
by taking 

K ^ 0i a) > (17) 
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(i.e. assuming for the quantities 8u a the expressions appropriate 
to the case where the w's are in fact functions of the #'s). These 
expressions vanish with the Stf,'s, i.e. at P 0 (so satisfying the 
initial conditions which, together with the equations (16), deter- 
mine them uniquely), and also at P 2 ; which proves the required 
result. 

It is thus proved that in order to construct the functions 
u whose total differentials are the assigned Pfaffians ^f ( ° } (satisfy- 
ing identically the equations (12) or the original equations (5) ) 
and which have given values u" at a given point P 0 , we need 
only join P 0 to any point P x by any line T, and integrate the 
system of ordinary differential equations along T. 

To complete the proof, we must now show that the differentials 
of the functions of the co-ordinates of P 1 obtained in this way 
are in fact the functions 0^ a) . Consider a point P 2 infinitesimally 
close to P Y \ to construct the values of the ?/'s at P 2 take the 
broken line made up of T and the small segment P X P 2 . It is 
then obvious that integrating the equations (14) along this line 
we get, in passing from P x to P 2 , the increment du A = 

6. Note on Mayer's method. 

The method followed in the preceding section to show the 
existence of the integrals of a complete system of total differential 
equations, is due to Morera. 

There was an earlier method, proposed by Mayer, by no 
means so clear, and seemingly dependent on a purely formal 
device. Morera's method, which is inspired by geometrical 
intuition, brings out the true reason for the success of Mayer's 
device, and provides a criterion for its validity. 

Mayer's method is to join the points P 0 and Pj by a segment 
of a straight line, instead of by any line T, so giving the equations 
(13) the form 

x, - a? + (*: - x { ])t (i -= 1,2,... n); 

the proof consists of a series of purely algebraic operations, 
instead of the proof developed above almost without calculations. 
In addition, while Morera's method can be applied if we merely 
suppose that the field in which the given equations hold is simply 
connected, Mayer's method, on the contrary, obviously requires 

l U 0.; j ) 2* 
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a much more restrictive hypothesis, namely, that any two points 
in the field can be joined by a straight line which lies wholly in 
the field. This property is expressed by saying that the field is 
convex. 

7. Application. 

Given a generic Pfaffian 

0 =-- H i X i (x)dx i9 
i 

we shall investigate whether it is possible to find a relation between 
x'b of the type 

f(x l9 x 2 , . . . x n ) = C (C constant), . (18) 
which shall be an integral of the equation 

i/r hiXidXi — 0, . . . . (19) 
i 

in the sense that the relation produced by differentiating equation 

(18) , namely, 

df ^ = 0, .... (18') 

] 

is equivalent to the equation (19). 

For this it is plainly both necessary and sufficient that the 
derivatives of the unknown function / should be proportional 
to the given functions X t . We therefore need some test to apply 
to the X/s themselves which will show whether they are pro 
portional to the derivatives of a single function not known in 
advance. 

This problem, which also occurs in geometrical questions (as 
we shall see in particular on pp. 263-265), reduces at once to a 
particular case of a total differential system. In fact, given that 
if/ does not vanish identically, and therefore has at least one of 
its coefficients not equal to zero, we may legitimately suppose 
that X n does not vanish identically. We can thus write equation 

(19) in the form 

dx n = - X^dx, (19') 
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In order that this may be equivalent to the equation (18'), 

we must have ~— 4= 0 in the latter* From this condition'it follows 

dx n ^ 

that the equation in finite terms (18) defines a function 

x H ~ u{x li x 2 , . . . x a _ l9 C), . . (18") 

which makes equation (18) an identity, and therefore also equation 
(18'), as well as the equivalent equations (19) and (19'). This 
last equation is evidently a particular case of systems of the 
type (4) consisting of one equation and one unknown function 
x H ; it must therefore be completely integrable, having as integral 
the function given by formula (18"), which depends on the arbi- 
trary constant C. Reciprocally, if (19') is completely integrable, 
then there will be a solution (18") depending on an arbitrary 
constant C; solving with respect to C, this becomes an integral 
relation of the desired form (18). The problem therefore reduces 
to expressing the completeness of equation (19'). 

Applying formula (5), the required conditions of completeness 

are 

L, = Lit fti=l,2....—l;<+i). 

Expanding the derivatives, these relations are easily put in 
the following form: 

A ' l W,- dxJ^Xdx,, dxjr^\dx L dxj U . (20). 
(i, j =- 1, 2, ... n —1; i 4= j). 

Introduce for the moment the restriction that all the other 
functions X, as well as X n9 are different from zero. We can then 
write 

whatever r and s may be, so that the conditions of integrability 
take the more concise form 

Pij + Pjn + Pm = 0 (i, j = 1, 2, . . . n - 1; i (22) 
The conditions (22) are \{n — 1) (n — 2) in number, this being 
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the number of ways of choosing two distinct integers, i, j, from 
the series 1, % ... n— 1. They represent all the conditions of 
integrability. Now the choice of the variable x n to be expressed 
as a function of the remaining x 9 & was arbitrary (subject only to 
the condition X n 4= 0); hence in general the relations 

Pu + P*k + Ph = 0 . . . . (22') 

must be satisfied, where i, j, k y are any three integers, no two of 
which are the same, chosen from the series 1, 2, ... n* Such 
a triplet can be chosen in $n(n — 1) (n — 2) ways; this is therefore 
the number of relations of the form (22'). But these are of course 
not all independent, since the conditions (22) (which form only 
part of (22') ) are sufficient for the complete integrability of the 
expression under consideration. In fact, it is easy to show directly 
that only \{n— l)(n — 2) of the equations (22'), e.g. those given 
by formula (22), are essential, the others reducing to algebraic 
deductions from them. 

This can be shown by moans of the following lemma, which 
holds whatever the terms p lk may be. If j> t/r (i> Jc, = 1, 2, ... n) 
is a double skew (or antisymmetrical) system, 1 and if for some 
fixed suffix a the cyclic relation 

Pa + Pka + Pai = 0 

is true for every pair of suffixes i, Jc, then this relation is also 
true for any three suffixes i, /<-, L 

To prove this, take the corresponding relations for the pairs 
Jc 9 I, and I, i, 

Pai + Pu + Pate " 0, 

Pli + Pia + Pa.1 = 0. 

Adding, and remembering the condition of skewness 

Pie* + Pak = 0, & C -> 

there remains 

Pik + Ph + Pu = 0. Q.E.D. 
Substituting in equations (22') the values of the jo's given by 

' I.e. a system of numbers such that a one-to-one correspondence, by a given 
law, exists* between them and the pairs of integers i t jt( = l, 2, . . , n), and such 
that pu = - pu* for atiy pair of indices whatever. 
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formula (21), and multiplying by XfX f X k so as to clear of 
fractions, we get the equations of condition 

/dXj_dXA /9X fr _SZA /dX J _dX\_A 
l \dx k dx s r >\dx t dx k r h \dx J dxj~ (23) 
k ^ 1, 2, . . . n). 

We thus find this whole set of equations as a necessary con- 
sequence of that group of them- -say, the group (20)— in which 
one of the suffixes is fixed, with the further condition that none 
of the JC's vanish. This last condition was applied at the point 
where we divided by the product of the -XV, it is, however, not 
essential, and can ultimately be discarded, as we shall now show. 
In fact, the equations (23) being necessary consequences of the 
equations (20) for any non-zero values of the Jf's, however 
small, and being integral in the X's and their derivatives, it 
follows that we may pass to the limit when any one of the X'a 
tends to zero. We therefore have, for all values of the Jf's, that 
the equations (23)— or a group of them of the type (20)— con- 
stitute the necessary and sufficient conditions for the complete 
integrability of the equations (19), or, in other words, the condi- 
tion that the n functions X t (x lt x 2 , . . . x n ) may be proportional 
to the derivatives of a single function. 

8. Mixed systems of equations. 

In certain problems we have to deal with mixed systems, i.e. 
those containing some total differential equations and some 
equations in finite terms: 

du x - S ( I a!l ^- (a - 1, 2, . . . m), . (4) 
1 

F k (x I u) - 0 (It -1,2,... v). . . (24) 

The discussion is essentially the same as in § 3 (p. 16). But 
we propose to go through it again in order to obtain, in a form 
suitable for use in concrete cases, the condition of complete 
integrability of a mixed system of the type (4), (24). 

It is obvious in the first place that a necessary condition for 
the existence of solutions is that the equations (24) (which we 
shall suppose mutually consistent and independent) are not more 
in number than m, the number of the unknowns u. If there were 
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exactly m of them, they would completely determine the w's, 
and we should only have to examine whether the w's satisfy 
the equations (4). We shall therefore suppose 

v < m, 

and shall imagine the equations (24) solved with respect to v 
of the u&, which will thus be expressed in terms of the #'s and 
the remaining m — v = /x unknowns u. 

As on p. 16, we shall call the two groups of u 9 s respectively 
u p (j8~ l f 2, . . . v) and u' a (a = 1, 2, . . . /a), so that the equations 
(24) may be written (cf. equations (5") ) in the form 

< =/s(*|«0 09= 1,2, ... v). . . (24') 

Corresponding to this division of the w's into two groups it 
will be convenient to divide the equations (4) into two groups 
(4a) and (46) (as was done on p. 16), which we repeat here for the 
reader's convenience: 

du' a = l, l X all dx i (a = 1, 2, . . ./x), . . (4a) 
i 

<?«;' - itX^tx | u)rfx; 08 = 1, 2, . . . v). (46) 

1 

We now propose to show that the given mixed system is 
completely integrable — and it will be called complete — if the 
following conditions are satisfied: 

(a) The conditions (5) for the complete integrability of the 
equations (4) are satisfied when after differentiation the values 
of u" given by equations (24') are substituted in them; they 
need not in general hold when any arbitrary functions are taken 
for the w'"s; 

(6) When the functions u" are replaced by their values as 
given by equations (24'), the equations (46) must be identical 
with the equations obtained by differentiating the equations 
(24'); or more concisely, the equations (46) must reduce to iden- 
tities on substituting from equations (24'). 

We shall show that if the mixed system is complete, in the 
sense now considered, then the equations (4a), when the u" 9 & 
in them are expressed in terms of the u's and the ac's by means 
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of equations (24'), constitute a completely integrable system of 
fx total differential equations in /x unknowns; the u"a can there- 
fore be obtained from them, and hence, by equations (24'), the 
w'"s; by hypothesis (6) above, the equations (46) will thus be 
satisfied. Hence the problem will be solved and its general integral 
(p. 22) will contain p—m — v arbitrary constants. 
To simplify the formulas, we shall agree that if 

<D (x I u', u") 

is any function whatever of the ac's and the it's, then 

[<&](* K) 

will denote the same function when the w"'s are replaced by the 
expressions (24'). We shall obviously have 



ow„ L3m v J i ldu a 1 ou v 



1, 2, . 



»), (25) 
/*)• (26) 



With this convention, we can write hypotheses (a) and (b) 
respectively in the forms 



{i, j — 1, 2, ... n) 



(a = 1, 2, 



(27) 



(28) 



We have therefore to examine the conditions of complete 
integrability of equations (4a), which will be 

d S*«_lA = d :^j3 (a = 1, 2, . . . jll), (29) 
and we have to show that they are satisfied identically. 
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Let 116 transform the left-hand side of (29) by first writing out 

in full the result of applying the operator - to a function of 
the x 9 & and the u"s* We shall get j 




or, using formulae (25) and (26), 
and finally, using (28), 

[^"'] + !'[ s if']^'J + ^K; !, ] [2 ' + " J - 

Remembering that the m arguments u consist of the two 
groups ?/ and ?/", it will at once be seen that this is merely the 
left-hand side of (27). Interchanging i and j 9 the right-hand side 
of (29) similarly is seen to be identical with the right-hand side 
of (27); equations (27) being supposed to hold, it follows that the 
equations (29) are satisfied identically. 

It follows that the integration of a complete mixed system of 
the type (4), (24), reduces to that of a complete (and therefore inte- 
qrahle) total differential system in jjl unknowns. The general integral 
therefore contains /x ----- m — v arbitrary constants. 

If the mixed system is not complete, i.e. if the conditions 
(a) and (b) are not satisfied without further restrictions, then 
discussion on the lines of § 3 (p. lfi) obviously shows that we must 
add to the equations (24) so many of the conditions (a) and (b) 
as do not reduce to identities in virtue of equations (24), since 
the equations (12) must hold whenever a set of m integrals u a 
exists. Repeating the same procedure, we reach either an incon- 
sistency, showing that equations (4), (24), can have no solutions. 
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or else a complete system with less than /x unknowns. In the 
latter case the number of constants in the general integral is also 
less than fx. 

A particular result of the foregoing discussion is that if v 
independent equations in finite terms are associated with a system 
of total differential equations in m functions u, the differential 
system being itself complete, then in the most favourable case 
(i.e. when the combined system is also complete) the number of 
constants in the integral is lowered by v units, from m to m — v. 

In general (i.e. when the mixed system is not complete) the 
integrals, if they exist, certainly contain less than m — v con- 
stants. 



CHAPTER III 

Linear Partial Differential Equations 
Complete Systems 

1. Linear operators. 

In this chapter we shall frequently use N to denote the number 
of independent variables, which will themselves be denoted by 
the letters z l9 . . . z x . 

hetf(z l9 . . . z iS ) be any function whatever, subject only to the 
condition of being difTerentiable to any required order. The 
term linear operator relative to/ will be used to denote the opera- 
tion by means of which an expression of the type 

1 vz t . 

is obtained from/, the a/s being any functions whatever of the 
z's. An expression of this kind will sometimes be denoted by a 
formula of the type Af, in which it is hardly necessary to point 
out that A is not a quantity, but the symbol of operation just 
defined. 

We have therefore * g 

A ~ S„ a — . 
1 oz v 



It can at once be verified that the linear operator symbol 
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behaves in exactly the same way as the differentiation symbol 
when / is a sum, a product, or a composite function (a function 
of one or more functions); i.e. for two generic functions f x , 
we have identically 

^(/i+/ 2 ) = Af x + AU .... (1) 

^(/i/ 2 ) = /i4/i+/i4/i. ... (2) 

with obvious extensions to any number of terms. Further, if 
/ is given as a function of n arguments v l7 v 2i . . . v w which are 
themselves functions of z, we obviously have 

4M v 2 , . . . v a ) - l f - Av x f a/ ^ 2 + . . . + V (3) 

Now consider the result of applying successively the two linear 
operators 

A V 9 

i 9^ 

the 6's, like the a's, denoting functions of z which are differentiate 
to any required order. 

The second-order operators 

A(Bf), B(Af) 

are thus completely defined; they may be written without danger 
of ambiguity in the form 

ABf, BAf. 

Writing out the first of these in full, we get by successive 
stages 

i oz„ i \ p dz„ \ p dz p / 

= S„ p a v -f J. + £ a„ b 

i * dz„ dz i p p dz y dz P 
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Similarly, interchanging A and B, and therefore a and 6, 
BAf = S /iBa^^b I* 

It appears from this that the two operators ABf and BAf 
are not equal; the second-order terms are however the same, 
as will be seen on interchanging the indices v and p in one of the 
two double sums. It follows that the difference of the two opera- 
tors in question is a linear operator of the first order; it is called 
the alternate function or Poisson's parenthesis relative to the two 
operators A and B, and is denoted by the symbol of operation 
( A, B) 9 so that 

(A, B)f = ABf — BAf = i v {AK ~ Ba v ) d l. . (4) 

1 0Z V 

It follows from the definition of the symbol that 

(A, B)f - - (B, A)f. .... (5) 

We shall now establish a formal property of linear operators, 
which we shall use farther on. 
Let there be n linear operators 

A J - - S„aj, |1 -- / (k = I, 2, . . . n), 
1 oz v 

and let any two linear combinations of these (which will also be 

linear operators), n 

Bf= ^ k X k A k f 
1 

n 

Of = 2 ht u h A h f 9 

be constructed, the A's and the /x's being any differentiate func- 
tions whatever of the independent variables z. 

We propose to show that the alternate function (B, C)f is 
a linear combination of the operators A and of their alternate 
functions. For the proof, it is sufficient to write out (B 9 C)f 
in full; this gives 

(B,C)f = BCf- CBf = i k \ k A k (Cf)-£ h to,A„(Bf) 

1 1 

)1 

= IK AfahAhf) — A (K A /)]• 
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Applying the rule for the differentiation of a product, the last 
expression becomes 

2*a[(M*MaMa/-- {jx h A h X k )A k f] + X kh X k f* h [A k A h f-> A h A k f\ 
i i 

so that finally 

(B, C)f = MX)A k f+ X kt i h (A k9 A,,)/]. 

Q.ED. 

2. Integrals of an ordinary differential system and the partial 
differential equation which determines them. 

Consider a system of n ordinary differential equations of the 
first order, in n unknowns x r Denoting the independent variable 
by t, and supposing the equations solved for the derivatives of 
the unknown functions, we get the equations in what is called 
the normal form: 

d £ - X t (x\t) (i = 1,2, . . . n). . (6) 

Any set of n functions x t (t) which satisfies the given equations 
is called a solution of the system. 

The term integral of the system, on the other hand, is used 
to denote any function f(x\t) which reduces to a constant when 
the aj's are replaced by any solution of the equations (0). We 
can therefore t&y that / is an integral if the result 

f(x | /) constant 

is a necessary consequence; of the differential equations (6). 

We shall now show that all the functions / with this property 
(and no other functions) satisfy a homogeneous linear partial 
differential equation of the first order; it follows, as we shall 
see farther on, that the integration of an equation of this form 
can always be reduced to that of a system of the type (0). 

Let f(x 1 1) be an integral of the equations (0); then by definition, 
when the x's represent functions of t which satisfy equations (6), 
we have 

f[x 1 1) = constant, 
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and therefore, differentiating with respect to t, 

dt 1 dt 
or, since the functions x^l) satisfy equations (6), 

dt + 7dx X ' ~ °* • • 



(7) 



This is the partial differential equation referred to. Introducing 
for shortness the linear operator 

A = + 3 (7') 

we can write it concisely in the form 

Af*= 0. 

Now by hypothesis equation (7), like the equation / = con- 
stant, from which it is derived by differentiation, becomes an 
identity when the x\ in it are replaced by any solutions whatever 
of the system (G); from this it is easy to deduce that (7) is an 
identity, that is, that it holds for any values whatever (in a 
suitable field) which may be assigned to the arguments x, t> 
of which / is a function. In fact, given n -f- 1 numbers sr 1 /, . , . sc", 
t 0 , belonging to a field within which the general existence theorem 
holds for the system (6), we know that there always exists a 
solution x, of the system (6), which takes the values . . . a£ 
when t = l 0 . Now equation (7) must hold (whatever t may be) 
when this particular solution x £ {t) is substituted for the #'s. 
In particular, putting t £ 0 , the equation is satisfied for the 
values x", t 0 , arbitrarily chosen in advance. Q.E.D. 

It is further evident that any function f(x | t) which satisfies 
equation (7), when the rr's and the fs in it are treated as indepen- 
dent variables, constitutes an integral of the system (6). In 
fact, since equation (7) holds however the x's are chosen, it will 
be satisfied in particular when we take a solution of the system 

(6) for the x's; but when this is done the left-hand side of equation 

(7) becomes identical with ^ The function / is therefore such 
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that when the x's are solutions of the system (6), — = 0, or 
/ = constant. 

To sum up, we can state that the necessary and sufficient 
condition that a Junction f(x | t) may be an integral of the system 
(6) is that it should satisfy the partial differential equation (7), in 
which the x's* and t are n + 1 independent variables. 



3. Principal integrals. 

Among the integrals / of the system (6) (which, as we have 
seen in the preceding section, can also be called integrals of 
equation (7)), there are, for each value t 0 of t, n of special impor- 
tance which we now proceed to specify. 

We take as our starting-point the most general solution of 
the equations (6), which is known to be a set of n functions of t, 
containing n arbitrary constants x", . . . x {) n : 

x, - 4> t (t | x°) (» = 1, 2, . . . n). . . . (8) 

The constants x° are the values of the x's for a given value 
< 0 oHsothat {<f>)f tB==x o (9) 

We shall show first that the equations (8) are soluble with 

respect to the #°s in a region round the point t 0 . Write them in 

the form . , ox n 

</>,(t I x°) — x t = 0, 

and consider the functional determinant of the left-hand side 
with respect to the x°'&, which is 

d — i^ 1 ~ Xi ^ ~~ Xz ' ' ■ ^ — ^y 

\ x x x 2 . . . x? tl } 

or, since the x 0, s are contained only in the <£'s, and not in the 
x's, 



T) _ (4*1 $2 ■ ■ ■ <£/A 

" U? xl ... a*J m 



Now calculate the value Z> 0 of this determinant for t = t 0 . 
Since the determinant itself contains no derivatives with respect 
to U we shall obtain the same result if we differentiate the functions 
<f>(t | afi) with respect to the rr°\s, form their determinant, and 
finally make t = £ 0 , as if we first make t = t 0 in the ^'s, and 
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then form the determinant of their derivatives. Following the 
second alternative and remembering the formulae (9) we see at 
once that the determinant becomes 



Now if Z), which is a continuous function of t 9 does not vanish 
for t = t 0 , it will have these same properties in some region 
round t 0 , and therefore within this region the equations (8) will 
be soluble with respect to the x°*s. 

Solving the equations, we shall get 



and the w's on the right-hand side constitute n integrals of the 
system (6). In fact, if we replace the x's in them by any solution 
of the system (6), (i.e. by a set of n functions obtained from the 
equations (8) by assigning particular arbitrary values to the 
constants ar°), then each w necessarily becomes equal to the 
corresponding x°, i.e. to a constant. 

The integrals of equation (7) obtained in this way are called 
principal integrals relative to t — £ 0 . From the definition it follows 
that 



Writing x instead of we see that a characteristic property 
of the principal integrals w t relative to t = t 0 is that each of the 
functions w L (x \ t) reduces to the corresponding variable x when 



Without undertaking a detailed study of the n principal 
integrals, we may at least show that none of them can be expressed 
as a function of the others only; i.e. that considered as n functions 
of the n + 1 variables x and t, they are independent. For this 
it is necessary and sufficient that the functional matrix (with 
n rows and n + 1 columns) of the w's with respect to the x's 
and t shall have n for its characteristic; i.e. that the matrix shall 
contain a determinant of order n which is not zero. Now if we 
take the determinant 




x<; = w t (x\t) (i = 1, % 



'9 • • • 



n), . 



(10) 




(11) 
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and apply to it the same considerations as we have already used 
for A we find that it = 1 for t — £ 0 (since then w L = x £ ) and 
therefore there is a region round t 0 in which it is not asero; hence 
the characteristic of the* matrix is n and the principal integrals 
are independent. 



4. Independent integrals. General integral. 

More generally, n integrals v v v 2 , . . . v n of equation (7) are 
said to be independent if the functions v € (x | t) (i = 1, 2, . . . n) 
are independent. Of course every function 

v 2 , . . . v„) (12) 

of the v's only is also an integral, as follows immediately from 
formula (3), remembering that for every v„ with the operator 
A as defined by (7'), we have 

Av L = 0. 

But the reciprocal theorem is also true, arid every integral 
of equation (7) can be put in the form (12), which therefore repre- 
sents the general integral of equation (7). 

To prove this, let / be an integral of equation (7); then the 
n + 1 equations 

d t i d x } 



+ = 0 (i= 1,2, 

at i ox. J 



<7» 



linear and homogeneous in the n + 1 quantities 1, X l9 . . . X n9 
which do not all vanish, will be satisfied. The determinant of 
their coefficients must therefore vanish, i.e. 



\t x x . . . xj 



This means that/, v,, v 2 , . . . v n are not independent. As the 
v's are independent, one of the determinants of order n of the 
functional matrix relative to v l9 v 2 , . . . v n is certainly not zero. 
But this is the case considered in Chapter I, pp. 5-8; hence we 
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conclude that /can be expressed in terms of the v's only, without 
involving t or the x 9 s. 

5. Direct study of the most general linear homogeneous partial 
differential equation, 

As a consequence of the relations which we have shown to 
hold between linear homogeneous partial differential equations 
of the first order, and systems of ordinary differential equations 
of the first order, we can in every case reduce the integration of 
an equation of the former kind to the integration of a system of 
the latter. In fact, equation (7) differs from the most general 
possible equation only in having one of its coefficients equal to 
1; but it will be at once obvious that every linear homogeneous 
equation of the first order can be reduced to this form-— an 
elementary remark which we shall examine in more detail. 

Consider the equation 

Af-~ i,a* f = 0 .... (13) 
1 oz v 

in N independent variables z x , . . . z N . 

At least one of the a's, say a N , will be different from zero. 
We may therefore divide the equation by a A . As a result of this 
step, z y is in what we may call a privileged position (the co- 
efficient of being reduced to unity); it is therefore natural 
cz s 

to denote it by a special symbol. Calling it t, and introducing the 
symmetrical notation (n = N — 1) for the remain- 

ing variables z, equation (13) becomes 

V = 0 . 

ot 1 a N ox t 
which will coincide term for term with equation (7) if we put 
£' ^ X i (i = 1, 2, . . . n). 

The corresponding system of equations (6) will thus be 

**i „ * (i _ 1, 2, . . . »). . . . (14) 
at a,y 
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Integrating these n equations, in which t is considered as the 
independent variable, we get the functions x t (t\ as 0 ), which reduce 
to x { - when t = t 0 ; solving for each of the £c°'s, the resulting 
expressions 

w % {x\t) = Wi(z) 

will be the n principal integrals of equation (13) relative to t = t 0 , 
and we may take 

F(w v . . . w n ) 

as the form of the general integral, where the symbol F denotes 
an arbitrary function. 

It will frequently be quicker not to find the principal integrals 
tv, but to obtain n independent integrals v l9 v 2 , . . . v n of the equa- 
tions (14) in any way whatever. The general integral is then at 
once expressible as an arbitrary function of the v's only, in the 
form 

Ffa, v 2 , . . . t>J. 

We have seen that we may choose as independent variable 
(for the system of equations (14) ) any one of the variables z 

which contributes an actual term ^ ^to equation (13) (the corre- 
sponding coefficient a not being zero). The choice may be deter- 
mined by reasons of convenience for the particular case concerned. 
In order to avoid prejudging the case before the necessary reasons 
for our choice appear, we may write the equations (14) in the 
form 

dx, dt n _ 

- = (% - 1, 2, . . . w), 



or, returning to the original notation, 
dz x _ dz 2 dz A 



(15) 



It will be seen that these can be at once written down from 
the given partial differential equation; it should be noted that 
if any of the a's is zero, the differential of the corresponding 
variable must also be equated to zero. 
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Examples: 

(1) Take the equation in three variables 

J1+ y\f + *f--0 (16) 

dx dy dz 

The corresponding system of two ordinary differential equa- 
tions is 

dx dy dz 
x y z 9 

i.e. d logx = d logy = d logz. 

Writing these in the form 

d logy — d logx — 0, 
d logz — d logx = 0, 



which is the same as 



rflog^ =: 0, 

x 



d log- = 0, 
x 



we get 



U = c x , - = c 2 (c v c 2 constants). 
x x 



Hence two independent integrals are 

x x 

and therefore the general integral will be 

f(1 z ). 

\x X/ 

This is merely the most general homogeneous function of 
degree zero. In fact, if <f>(x, y, z) denotes the latter, then by 
definition we must have, for any value of A whatever, 



<f>{\x, \y, As) = <f>(x, y, z); 
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and therefore, putting A 

x 

<t>h, y , -) = H x > y* z); 

\ X X/ 

<f> is therefore in effect an arbitrary function F of ^, 5 



x x 



We have thus found for a particular case (which can obviously 
be generalized) Euler's well-known theorem on homogeneous 
functions. 

(2) Take the equation 



df df df 
dx dy dz 



x y 
a b 



= 0, 



where a, 6, c are constants which are not all zero. Putting 

x y\ 



X 



y z 




Z X 










\b c 




c a 





a b 



the equation may be written in the form 

ox dy dz 

and the corresponding system of ordinary differential equations 
is 

dx dy dz 



(17) 



Taking a as independent variable, we shall have to integrate 
the two equations 

dy = Y dz Z 
dx X' dx X 



But we can find two independent integrals more easily by 
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another method. In fact, from the form of the equations defining 
X, Y 9 Z 9 we see that the equations 

xX+yY+zZ = 0) 

aX + bY + cZ = 0 j ' ' * - * ' 

are satisfied, since the given determinant vanishes if the elements 
of the first row are replaced by x, y, z, or by <z, b, c; expanding, 
we find that the equations (18) are identically true. 

Now if dx 9 dy, dz satisfy equations (17), i.e. are proportional 
to X, Y, Z, we can substitute them for X, Y, Z in equations 
(18), so getting 

xdx + ydy + zdz — 0, 
adx + bdy -|- cdz — 0. 

The left-hand side of each of these equations is an exact 
differential; hence, integrating, we get as a necessary consequence 
of equations (17) 

# 2 H- ?r + z 2 - c A , 
ax -\- by ~|- cz -~ c 2 . 

These are two particular integrals of the system, which are 
certainly independent, since at least one of the coefficients a 9 b 9 c, 
is not zero. The general integral is thus 

F(x 2 -f y 2 + z 2 , ax + by + cz). 

Geometrical interpretation. — When there are three (or two) 
variables, the foregoing discussion can be given a geometrical 
interpretation in ordinary space (or in a plane). For this purpose, 
let any integral f(x, y, z) of an equation 

(«) X*l + Z*/-0 
ox dy dz 

be considered as the parameter of a family of surfaces f = con- 
stant. By a suitable choice of the constant on the right-hand 
side of this equation, we can make one surface of the family 
pass through a point P arbitrarily chosen in advance; it is plainly 
only necessary to give this constant the value of / at the chosen 
point P. The equation (a) which / must satisfy expresses the 
geometrical fact that at any point P the normal to that surface 
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of the family which passes through P is normal also to the direc- 
tion of the vector (X, Y, Z), given as a function of position. 
The system of equations associated with equation (a), namely 

w X ~ y - z' 

which expresses the relation of two of the variables to the third, 
represents on the other hand a property of certain curves. We 
know from the existence theorem that we can find (and in only 
one way) two functions x — x(z), y y(z) which satisfy the 
system of equations (6), and which take values x 0 , y 0 , arbitrarily 
fixed in advance, when z has the value which is also arbitrary; 
hence we can state that, given any point P, there passes through 
it one and only one curve which has the property expressed by 
the equations (6), i.e. that of being at every point in the direction 
of the vector ( X, Y, Z). An aggregate of curves such that one 
and only one of them passes through every point of a given field 
is called a congruence. 

There is a very simple relation between the family of surfaces 
which represent the integrals of equation (a) and the congruence 
of curves which represent the solutions of the system of equations 
(6), namely, that each curve of the congruence lies ivholly on a surface 
of the family. In fact, consider a point P(x, y, z), and let L be 
the curve of the congruence, and S the surface of the family 
considered, which pass respectively through P. We shall show 
that a point which undergoes an infinitely small displacement 
along L, from P to a point P\ does not leave the surface, or in 
other words that if the equation of the surface S,f(x, y, z) = C, 
is satisfied by the co-ordinates x, y, z of P, it is also satisfied by 
the co-ordinates x + dx, y + dy, z + dz 9 of P\ The result is 
obvious, since for the co-ordinates of P' / becomes 

f(x, y,z)+ df dx+l f dy + V dz, 
ox dy cz 

and as dx, dy, dz, are by equations (6) proportional to X, Y, Z, 
the increment of / is proportional to 

cx ay dz 
which vanishes by equation (a). 
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The surfaces considered are therefore formed by curves of 
the congruence; these curves are called their characteristics. 

We can of course avoid the use of infinitesimal displacements 
in proving the property that every curve of the congruence lies 
wholly on a surface / = constant. The argument, which is 
essentially the same as before, will be as follows. Let the vari- 
ables x, y, z, in the expression f(x, y, z) be considered as the co- 
ordinates of points of a curve L of the congruence, and let A 
be any parameter, e.g. the arc of the curve, which fixes definitely 
the position of a point on L; x, y, z, are thus considered as definite 
functions of A. Substituting these functions of A for x 9 y 9 z in 
the expression f(x, y, z) 9 we shall see that the result is independent 

of A, or that ^/ ~ 0. We have in fact 
dX 

d l = d l dx + H d y + d J dz - 

dX dx dX dy dX dz dX* 

but by equations (6), dd . d ^ along L, are proportional to 

a A a A dX 

X, y, Z, and /satisfies equation (a); hence / = 0. The fact 

dX 

that / remains constant along L is the algebraical equivalent of 
the geometrical property that the curve L belongs to a surface 
/ = constant. 

6. Integrals of a total differential system, and the associated 
system of partial differential equations which determines them. 

Starting from a system of ordinary differential equations, 
we have succeeded in integrating the most general linear homo- 
geneous partial differential equation of the first order. By an 
analogous procedure, starting from a system of total differential 
equations, we shall succeed in integrating the most general 
system of linear homogeneous partial differential equations of the 
first order. 

Consider the system of equations 

du a = XiX^dXi (a = 1, 2, . . . m). . (19) 



We shall apply the term integral of this system to every 
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function f(x \ u) which is such that it reduces to a constant when 
the m's are replaced by any solution of the equations (19). 
Differentiating / we get 

and, if the it's are solutions of the equations (19), 
df= ZJ^ + ^^X^dx, 

The necessary and sufficient condition for the vanishing of 
this differential, whatever the dx's may be, is evidently that the 
n equations 

l S + X a[i = 0 (i = 1, 2, . • . n) . (20) 

cXi i 3w a 

shall be satisfied. They must be satisfied not only when the 
m's are solutions of the equations (19), as is clear from what has 
already been said, but also identically, which can be seen in the 
same way as for the corresponding result in § 2 (p. 36). 
Introducing the linear operators 

ft, = A (21) 

i ou a 

B, - -I- Q ; (i 1, 2, . . . n),. . (22) 

the equations (20) may be written in the form 

B t f ---= 0 (i ---- 1,2,... n). . . (20') 

The system of equations (20) or (20') is said to be associated 
with the system of equations (19); the necessary and sufficient 
condition that / may bo an integral of the system of equations 
(19) is that it should satisfy the associated system of partial 
differential equations (20), or, in other words, that it should be 
an integral of the associated system. 

7. Principal integrals, as typical cases of independent integrals. 

Suppose that the system of equations (19) is completely 
integrable, and let us assign fixed arbitrary values to the con- 
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stents £C° and tp in the field in which the X's are regular; then 
we know that there exist m functions, 

Ua = <f>a(*\u°) (a = 1, 2, . . . m), 

regular in the region round the values afl 9 u° 9 which satisfy the 
equations (19), and which become respectively equal to the 
assigned constants when x h — x". The equations so written 
are soluble with respect to the quantities u° in a region round 
the point x°, as can be shown by means of the same arguments 
as those of § 3 (p. 38). Suppose them solved; we can then express 
the v° R in terms of the x 9 s and the w's, and we shall write 

w a (x\ u) = u° a . 

The w*8 are evidently integrals of the system of total differential 
equations (19), and are therefore also integrals of the system of 
partial differential equations (20); we shall now show that they 
are independent. 

Consider the functional matrix of the w 9 s with respect to the 
x'b and the w's; we shall have to show that its characteristic is 
m, or, which comes to the same thing (since it contains no deter- 
minant's of order > m), that it contains a determinant of order 
m which is not zero. Now the determinant 

/w ± w 2 . . . w m \ 
\u ± u 2 . . . uj 

becomes ~ 1 when x t — x", and therefore is different from 
zero in a region round that point; hence the required result 
follows. 

The m independent integrals w are called principal integrals 
of the total differential system (19), or of the partial differential 
system (20), corresponding to the values x° { of the independent 
variables x. We have thus shown that, with the hypothesis that 
the system of equations (19) is completely integrable, the system 
of equations (20) (or (20') ) admits of m independent integrals, 
which can be determined in an infinite number of ways; namely, 
the principal integrals just considered, which in general vary 
with the choice of the initial values aP. 

Here too, as on p. 40, we shall say more generally that the 
m integrals v l9 v 2i . . . v m of the system (20) are independent 

< D 0ft5 ) 3 
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if the functions v(x \ u) of the n + m variables x and u are 
independent. 

8. The general integral. 

By a property already noted of linear operators, if we con- 
struct any function whatever of the u's, 

F(v v v 29 . . . v ul ), .... (23) 

we get a new integral of the system of partial differential equations 
(20). In addition, for the system of equations (20), as before for 
the single equation (7), the most general function which satisfies 
the system is included in the expression (23); or this expression, 
when F is considered as an arbitrary function, constitutes the 
general integral of the system. 

To prove this, let f(x \ u) denote any integral of the system 
(20), and consider the functional matrix of the ,n + 1 functions 
(of m + n variables) v v v z . *. . v m , f: 





dv 1 


dv x 


dv 1 


dx 1 


dx n 


du x 












dx L 


dx a 








?f 


df 


df 


dx 1 









If we can show that the characteristic of this matrix is 
it will follow that there exists (m + 1) — w* t or 1 , relation between 
the m + 1 functions which does not involve the #'s or the u'a 
(cf. § 7, pp. 9-12). This relation must necessarily contain / ex- 
plicitly, since there can be no relation connecting the v& alone. 
We can therefore solve it for/, which will have the form (23), 
so giving the required result. 

We shall first make a slight change in the form of the matrix 
M y by making it contain only derivatives with respect to the 
w's. This is easily done, for since 



Bp m = 0, BJ = 0, 
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we get from formula (22) 

OX; OX; 

The matrix thus becomes 



M = 



























9^ 





"1/ 



To prove that the characteristic is m, we have to prove: 

(1) That every determinant of order m + 1 (the highest 
order possible) vanishes; 

(2) That at least one determinant of order m does not vanish. 
A generic determinant of order m + 1 will be formed by 

taking all the m + 1 rows, and m + 1 columns chosen arbitrarily 
from the m + n of the matrix. These m + n columns are of two 
types: the first n contain the operators Q, the remaining m the 
9 

operators ; let r columns be taken of the first type, and s of 

du 

the second, with of course r + s = m + 1. Now in order to 
write down in a perfectly general form a row of this determinant, 
which will contain either the v\s or (if it is the last row) /, we shall 
use the symbol <f> to denote either one of the v's or /; we can then 
write the row as follows: 

d(f> d<f> d<f> 



du t du,. 



where the suffixes h v h 2i . . . h r constitute any arrangement of 
r numbers, chosen from 1 to n, and the suffixes Jt^, h 2 , . . . k s any 
arrangement of s numbers, chosen from ] to m. Bemembering 
the definition of £2 given in (21), we see that each of the first r 
elements of the general row is a linear combination of the other 
elements; or, as we usually say, that the first r columns of the 
determinant are linear combinations of the other columns. The 
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determinant can thus be broken up into a linear combination of 
determinants of order m + 1 in which all the columns are of the 

second type ^i.e. are composed of terms But there are in 

all only m columns of the second type; it is therefore impossible 
to choose m + I of them without repeating at least one. It 
follows that in each of these partial determinants there are at 
least two columns equal, and therefore these determinants all 
vanish. The general determinant of order m + 1> which is a 
linear combination of them, must therefore also vanish. This 
proves the first of the required propositions. 

The existence of a non-vanishing determinant of order m 
is a direct consequence of the hypothesis that the integrals v 
are independent. 

AVe have therefore proved completely that the characteristic 
of M is m, and therefore that / can be expressed in terms of the 
independent integrals i.e. that / has the form given in (23). 

9. Direct study of the most general system of linear homo- 
geneous partial differential equations of the first order. Complete 
systems. Jacobian systems. 

Let us consider a generic system of n linear homogeneous 
partial differential equations of the first order, in N variables, 
and with only one unknown function: 

A k f - E F a^ - 0 (A = 1, 2, . . . n). . (24) 

1 vZ v 

We shall suppose that these n equations arc independent, 
and we can therefore assume n < N. In fact, if n > N, the 
equations, which we have supposed independent, considered as 

algebraic equations in the N quantities would be mutually 

°** df 
inconsistent; and if n = N, this would imply that y = 0, 

or/ — constant. Further, it is clear that every / which satisfies 
equations (24) must necessarily also satisfy the following \n(n— 1) 
equations (obtained by constructing all possible Poisson's paren- 
theses with the given operators): 

(A h , A k )f - 0 (h, h = 1, 2, . . . n). . (25) 
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These ore differential consequences of the given system. Since 
derivatives of the second order disappear from equations (25), 
it may happen that these equations, or some of them, are also 
algebraic consequences of the system, i.e. that they can be obtained 
algebraically by taking a linear combination of the n given 
equations. 

If all the equations (25) are algebraic consequences of the 
system of equations (24), this system is called complete. 

In the opposite case, consider the system formed by adding 
to (24) those of (25) which, together with (24), are linearly inde- 
pendent. The new system will be equivalent to the original one, 
and will contain one or more additional equations. Repeating 
the same procedure for the new system, and so on, we shall reach 
either a complete system or else a system in which the number 
of equations is equal to or greater than N — the case of mutual 
inconsistency, as already noted at the beginning of this section. 

We need therefore only consider complete systems. The 
condition of completeness can be written in the following form: 

(A h9 A ic )f^ ZiPmAif, • • ■ (26) 
1 

where the coefficients p denote functions (a priori of any form 
whatever) of the independent variables z. From the definition, 
and applying identity (5), it follows that the coefficients p satisfy 
the relations 

Vuu — —Pm *,!==■ 1, 2, . . . n). 

A particular case — of special importance — of a complete 
system is that in which all Poisson's parentheses are identically 
zero (i.e. all the coefficients p are zero); when this is so the system 
is called Jacobian. 

10. Equivalence of every complete system to a Jacobian system 
with the same number of equations. Note on Cramer's rule. 

We propose to show that a complete system can always be 
replaced by a Jacobian system with the same number of equations; 
thus the consideration of any complete system can be reduced 
to that of a Jacobian system. 

Starting from the system of equations (24), we shall suppose 
that it is complete; i.e. that the equations (26) are satisfied. We 
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shall adopt the following procedure: we shall construct « distinct 
linear combinations of the n given equations, 



with the condition 

tit II 4= 0 



i 



(i = 1, 2, . . . n), 



(27) 



and we shall choose the coefficients c in such a way that the 
system (27), which is equivalent to the given system, may be 
Jacobian. 

Before doing this, however, we shall write the given equations 
in a slightly different form. We know that the matrix of the 
a's has its characteristic equal to n (sinr:e the equations are inde- 
pendent); let us arrange the variables in such an order that the 
determinant a formed by taking the first n columns of the matrix 
may be that which does not vanish (or one of those which do 
not): 



a = 



a 21 



a 0 



*w2 



a„ 



4= 0. 



We shall next divide the variables z into two groups: we shall 

and the remaining N — n 



call the first n of them a^, x 2 , 
— m we shall call v lt u 2 , . . . u m . With this notation, the given 
system can be written in the form 



ka>« d / +U k f 



0 



(k — 1, 2, . . • ri), 



where U k denotes an operator involving only derivatives 
with respect to the ^'s, the explicit expression of which does 
not for the moment concern us. Now solve 1 these n equa- 

1 Cramer's well-known ruk* may be put in the following form, which we shall 
frequently use, here and elweu here. 

Let there lie given n linear equations 



2„«„*t;„ = V k {k = 1, 2, 



n), 



(a) 



such that the determinant a of their coefficients is not zero. 

We shall denote by a r * the reciprocal element of the generic element Or 9 of the 
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tions with respect to the terms Putting them in the form 

o x v 

k ** I s = - u k f, 

multiply each equation by the corresponding a ik (the reciprocal 
element of a ik in the determinant of the a's) and sum with respect 
to k from 1 to n* We thus get n linear combinations: 

^L = -i k a fk U k f (t = 1,2, ...n), 

which are independent, since by a well-known result the deter- 
minant of the coefficients a lk is equal to \ and therefore is not 

a 

zero. These equations can be written in the more concise form 

l f + L1J - 0 (t «= 1, 2, . . . n), . (24') 

o x t 

where the C2 <: 's represent linear operators containing, like the ETs, 

determinant a; i.e. the algebraic complement (or minor) of a ra divided by a. 
Then, applying two ordinary theorems on determinants, and indicating by either 
zero or unity, according as r =J= # or r = s r we get 

Z k a ik oJ k =z (0) 

1 * 

l k a ki aV = 5^ (?) 

1 z 

Applying these properties, the equations (a) can be solved by constructing 
suitable linear combinations of them. For instance, to find £4, multiply the kth 
equation by a lk ; then giving k all values from 1 to n, and summing, we get 

n n n 
2 k a*2 v a vk Z y = Z k a l * % 

The left-hand side of this equation can be transformed as follows ; 
hence the solution is given by the formula : 

= s****, ■ («') 

1 
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only derivatives with respect to the u% and therefore of the 
form 

d 



The system of equations (24') is equivalent to the original 
system (24). The only formal simplification is the specially 

?) 

simple way in which the terms in occur. But we shall show 

ox 

that the system (24') has the advantage of being both complete 
and Jacobian; it therefore constitutes precisely the system we 
are in search of, containing n linear combinations which we 
have denoted in equations (27) by the operators B t ; the co- 
efficients c lk of (27) will be identical with the coefficients a ik . 

We shall first show that the system (24') is complete. We can 
write it shortly in the form 

BJ = 0 (24") 

where 

+ • <28> 

Since the operators B are linear combinations of the A's, it 
follows from a theorem proved above on p. 35 that Poisson's 
parentheses (B t , Bj)f are linear combinations of the expressions 

^kf> (A fi , A k )f. 

Now since the system (24) is complete, it follows that the 
expressions (A h , A, r )f are in their turn linear combinations of 
the expressions Af; so that the operators (B t , Bj) are seen to be 
linear combinations of the ^4's alone. But the A 9 b are linear 
combinations of the iJ's (since the JS's are independent com- 
binations of the A'r)\ hence ultimately the operators (B t9 Bj) 
are linear combinations of the ZTs. In other words, the system 
(24") also is itself complete. 

We can therefore write 

(B if £,)/ = XrfifiBtf, .... (29) 



where the coefficients y are analogous to the p's of formula (26). 
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To show that the system (24") is Jacobian, we must prove 
that all the coefficients q vanish. We note that both sides of 

equation (29) are linear in the terms and the identity cannot 

hold unless the coefficients of the same derivative, e.g. ^ , are 

dx h 

the same on both sides. We proceed to find these coefficients. 
The left-hand side of (29) can be written in the form 

We saw on p. 35 that the result contains no second-order 
derivatives; it is therefore unnecessary to apply the operator B 
to the derivatives of /, so that the expression in question reduces 
to 



(30) 



As this contains no terms in — , it follows that the coefficient 
df 

of every - - - is zero. On the right-hand side the coefficient of the 

corresponding term is q, Jh (remembering the definition of B)\ 
hence every q — 0, and in consequence 

(B l9 Bj)f ----- 0, 

or the system (24') is Jacobian. 

A further remark which will shortly be useful is that from 
the vanishing identically of each side of equation (29) and from 

equation (30) it follows that the coefficients of the terms in 
also vanish, or from (30), u 

BiXaU — BjX a \i — 0 (31) 

11. Integration by means of the associated system. 

Gathering up the foregoing results, we now see that, given a 
system of linear homogeneous partial differential equations of 
the first order, we can find its general integral— if one exists — 
by means of the integration of a complete system of total differ- 
ential equations. 
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We have seen how to transform the given system into a com- 
plete system (if it is not so already, and provided it contains no 
inconsistency). We now note that the Jacobian system (24') 
which we reached as a result of transforming the generic complete 
system (24) for other purposes, is identical with the system (20), 
which originally arose as the system associated with a generic 
system of total differential equations. The important point here 
is that if with the coefficients X belonging to the system (24') 
we construct the system of total differential equations (19), this 
system is completely integrable. 

In fact, the condition for this is that 

= **!\J = 1, 2, . . . n; a = 1, 2, . . . m), 

and remembering the definition (28) of the operators B, these 
can be written shortly in the form 

The equations (31) show that the X 9 s obtained from the 
system (24) satisfy these conditions. 

Having transformed the given system into the form (24'), 
we need therefore only construct the associated system (19) 
and integrate by the method given in the preceding chapter: 
the most general solution will be obtained in the form 

u a = <f> a (x | w°) (a = 1, 2, . . . m). 

Solving these m equations with respect to the u 0t &, we get 
m = N — n principal integrals, and constructing any function 
whatever of these integrals we have the general integral of the 
given system. 

This systematic method of integration is in theory quite 
general and covers all possible cases, but it is somewhat laborious 
to apply. In practice it is often shorter to integrate the equations 
separately, and then to look for the m common integrals which 
certainly exist, when we have ascertained beforehand that we 
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are dealing with a complete system. The following may be given 
as an example. 

Consider the system 



Af -~- a, V + x*l + + x*l = 0, 

vx : dx 2 dx 3 dXi 

5/ - x 2 + x, ^ — as 4 + x,, J - = 0. 
oil vx s ox 3 CXi 



(32) 



We shall first show that it is not only complete, but also 
Jacobian. To show this as shortly as possible, we put 

3^ 3a? 2 3^ 3# 2 



dx 4 d# 3 3cc 4 



so that A A x -\- A 2 , B — B x -f 2? 2 , and then construct 
the alternate function of the two given operators. We get by 
successive transformations 

(A, B)S- = A Rf — BA f 

= A^BJ + A Z BJ + A^J + A 2 BJ 

— B X AJ— BjA 2 S— B 2 AJ~ B 2 AJ 
= (A v B 1 )S+ (A 2 , BJJ+ (A v B 2 )f+ (A 2> B z )f. 

Now it can be shown directly that 

(A lt BJS = 0, (A Z) BJS = 0, 

and interchanging x v x 2 , and x 3 , a? 4 , it follows that 

(A 2 , B 2 )f - 0, (A l9 B 2 )f = 0. 

Hence 

{A, B)f = 0, 

which means that the system is Jacobian. It will therefore have 
4 — 2 ~- 2 independent integrals, or rather (cf. p. 40) an infinite 
number of pairs of such integrals. 

To find one such pair, note that the first equation (which is 
of the type considered in the example on p. 43) has as its general 
integral any homogeneous function of degree zero in the variables 
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x v # a> x 3 , a? 4 , Wc need therefore only find two independent 
integrals of the second equation which are homogeneous of 
degree zero. 

Now the system of ordinary differential equations associated 
with the second of the equations (82) is 

dx x dx 2 dx$ __ dx A 

x 2 x x x 4 x* 3 

The equation formed of the first two of these terms can be 
integrated immediately, and gives 

x* + x/ « a 2 ; (33a) 

similarly the other two terms give 

* 3 2 + *4 2 ^ 6 2 (336) 

where a and b denote constants. 

Equating the first and third terms, after substituting in them 
for x 2 and x 4 the expressions given by equations (33a) and (336), 
we get 

dx x dx§ 
a 2 — Xj 2 *J 6 2 — a; 3 2 

and therefore integrating 

_ 1 X\ . _ 1 X>t 

sm 1 1 — sin 1 / ^ c, 
a b 

where c is a third constant. 

This last integral can be put in the form 

sin 1 - X K -sin" 1 = c . (33c) 

V x* + x 2 2 V x 3 2 -f x 4 2 



We also get from (33a) and (336) 

x* + x 2 * = a 2 
x 3 2 + # 4 2 6 2 



(3M) 



Of the four integrals thus found, the last two, (33c) and 
(33rf), are homogeneous of degree zero, and are therefore also 
integrals of the first equation; and it would be easy to verify 
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that they are independent. Hence the general integral of the 
system of equations (32) is 

/(sin" 1 ~ . - sin" 1 , .*iLL^V 

' \ s/x* + X* Jx* + X? Xj + « 4 V 

where / is the symbol of an arbitrary function. 

CHAPTER IV 

Algebraic Foundations of the Absolute 
Differential Calculus 

1. Effect on some analytical entities of a change o£ variables. 

This chapter is devoted to the study of the effect on some 
analytical entities of a change of variables. In this first section 
we propose to give some examples showing the nature of the 
general considerations which will be subsequently established. 

Consider n independent variables x l9 x 2> . . . x n9 which we shall 
as usual denote collectively by x 9 and suppose a transformation 
applied to them which leads to another set of n independent 
variables x ; it is understood that the transformation used is 
reversible, i.e. that the transformation formula* 

x g = x t (x) (i = 1, 2 ») . . . (i) 

can be solved for the x's in the field considered, so that we have 
simultaneously the equivalent equations 

*< - (!') 

The geometrical name for this operation is of course change 
of co-ordinates, to fix the ideas, we may take n = 3, so that we 
are passing from Cartesian orthogonal co-ordinates x, y, z to 
three generic independent combinations of them (curvilinear co- 
ordinates) q l9 q 2 , q 3 . 

Now suppose that in dealing with a physical, geometrical, 
or other question we find that we have to consider riot only the 
variables x 9 but a certain aggregate of entities connected with 
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them. For instance, in a certain region of physical space referred 
to Cartesian co-ordinates x, y y z, let the temperature T be defined 
at every point; then it is a determinate function of x, y, z. Or 
we may suppose that a field of force exists in the given region, 
and we shall then have to consider at every point a vector, and 
hence its components, i.e. three functions X, Y, Z of x y y, z. 
Now change the variables. We have to find some way of expressing 
the same quantity or physical phenomenon (temperature, force, 
&c); for this purpose we find that we have to introduce certain 
parameters which in the new system of reference will with advan- 
tage take the place of those which were more suitable when we 
were using Cartesian co-ordinates. These new parameters are 
naturally called transforms of the original ones; they are obtained 
from them by a law which cannot be assigned a priori, but depends 
on the nature of the problem, and in part on suitable conventions. 
For instance, in the new system the temperature T will be a 
function of g a , q 2 , <y ; j, such that the same temperature belongs 
to the same point of space, whether the calculations are made 
with the original or with the new variables; hence T as a function 
of the g's will be obtained by substituting for x, y, z in T(x, y, z) 
their values in terms of q v gr 2 , y 3 . This kind of behaviour, which 
is the simplest we shall have to consider, is called transformation 
by invariant, all functions of position which have a value inde- 
pendent of the system of co-ordinates chosen are transformed in 
this way. 

With the components of a vector, in the other example cited, 
this does not happen. If in fact, as we may suppose, the vector 
has a magnitude and a direction which are independent of the 
system of co-ordinates chosen (we shall think of it as being 
defined physically as a force), its components, on the contrary, 
even when the point considered remains unchanged, change their 
values when the frame of reference is changed. This is obvious 
in the case of a rotation of Cartesian axes. If, however, the trans- 
formation considered is not of this particular kind, we do not 
know a priori what to substitute for the projections X, Y, Z 
of the vector on the axes of the old system in order to specify 
the vector in the new system; 1 i.e. we have to determine the law 

1 We shall see in various parts of Chapter V how the introduction of new 
variables q^ y a , g 9 gives rise geometrically to cor responding co ordinate surfaces 
qi = constant, q% — constant, q s = constant, and co-ordinate lines which are their 
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of transformation which will meet the needs of the case in ques- 
tion. The most suitable criterion to take as a guide in making 
our choice is found by introducing, alongside the given vector, a 
scalar quantity with a physical significance which is transformed 
by invariance. In this case we take two infinitely near points 
whose co-ordinates differ by dx, dy, dz\ then the work of the 
force whose components are X f Y, Z, in passing from one of 
these points to the other, will be 

dW Xdx + Ydy + Zdz; ... (2) 

this scalar quantity has a physical significance which is invariant, 
and it can therefore be concretely determined. From the mathe- 
matical point of view it is an important fact that with any system 
of orthogonal axes Oxyz the Cartesian components of the force 
are identical with the coefficients of dx, dy, dz in this expression. 
Changing to the curvilinear co-ordinates q l9 q 2 , q Si we can find 
the resulting values of dx, dy, dz by means of the differentials 
of the new variables, using the formulae 

dx = S, °* dq t) &c. 
The work dW will take the form 

1 \dq, dq t dq, / 

which is analogous to formula (2). 
In fact, putting 

dx x dy y + d ±z = q {i = h 2j 3) (3) 
dq t dq,- dq t 

we get dW — Q x dq x + Q 2 <kz + Qs <kz> 

intersections. Bearing this in mind, if wo proposed to use geometrical criteria 
taken from our co-ordinate system in order to specify the elements which deter- 
mine a vector, we should find ourselves faced by four possibilities, all equally 
acceptable, and with one or another preferable according to circumstances. At 
every point, in fact, the tangents to the co-ordinate lines and the normals to the 
co-ordinate surfaces form two supplementary trihedra, which ar« in general 
oblique-angled, and therefore distinct; and a vector may be defined either by 
its orthogonal projections on, or by its components along, either of these two 
trihedra. 
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The quantities Q }9 Q 2 , Q 9 here hold the same position as did 
X, Y, Z in Cartesian co-ordinates; it therefore seems suitable 
to call them the components of the force in the new system of 
reference, so that we may say that formula (3) represents the 
law of transformation of the components of a vector. This law 
is called comriance. 

AVe can also reach this law from a different point of view, 
which, however, we shall show in a moment to be really a particular 
case of the preceding argument. Consider an invariant function 
u(x, y, z)\ we shall try to find the most convenient law of trans- 
formation of its three derivatives ^ U 9 ^ U which are evidently 

ox dy dz 

functions of x, y, z. A natural course is to consider the three 

derivatives ^ as being the expressions which correspond 

dq x dt h dq. s 

to them in the new system of reference; these are of course given 
by the ordinary formulae 

du du dx . du dy , du dz t £ . ov /AX 

, - + a~ ' ' + T - (t = 1, 2, 3). (4) 

dq t dx dq t By dq, dz dq L 

If instead we were to assume transformation by in variance, 
the three quantities we are considering would represent deri- 
vatives of a function only in the original Cartesian system of 
reference, while in any other they would in general lose this 
special property. 

The formula* (4) are evidently a particular case of the formulae 
(3), in which the derivatives of a single function u have been 
substituted for the components of the generic vector. The real 
reason for this is found in the fact that the law of persistence 
of the derivatives can also be included as a special case of the 
in variance of a linear differential form. As a particular case, we 
need merely replace dW (which is not in general an exact differ- 
ential) by the total differential du, which may be expressed in 
either of the two forms 

du j , du j . du j 
_ a x f - - dy + - dz 
ax dy dz 

j du , . du j , du j 

and a dq x + dq 2 + dq z . 

dq x dq% dy* 
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The foregoing remarks will suggest what it is we propose 
to do, though naturally this will become clearer as we 
proceed. 

Given in a certain system of reference (which may be of any 
kind whatever) a set of quantities having a certain significance, 
physical, geometrical, or other, we assign a law of transforma- 
tion by means of which a set of quantities having the same 
significance is associated with any other system of reference, 
and we are thus led to introduce a set of parameters, collectively 
independent of the system of reference, whether Cartesian or not. 
This is the basis of the conceptual importance and the fertility 
of the considerations which we propose to develop. 

2. m-fold systems. Forms of degree m and m-ply linear 
forms. 

We shall first define a system of order m or m-fold system. 
We apply the term to a system of numbers 

A . 

which are such that a one-to-one correspondence with a specific 
law exists between them and the set of m integers i v i 2 , . . . i m , 
where each of the z's can take all integral values from 1 to n. 
The number of elements of an m-fold system is thus n u \ this 
being the number of permutations (with repetitions) of n numbers 
taken m at a time. It is not necessary that these n ,u elements 
should be all different. 

A system composed of a single number (which may be repre- 
sented by a letter without a suffix) may be considered as a system 
of order zero. A simple (one-fold) system will be the aggregate 
of n elements which can be represented by the notation 

A, (i = 1,2,... n); 

e.g. the set of three components of a vector, for which m — 1, 
n = 3. 

A double (2-fold) system will be of the type 
A 0 {% j r-= 1, 2, . . . n), 

and will consist of n 2 elements; and so on. 

A system of order greater than 1 is called symmetrical if all 
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the elements in it which differ only as to the order of their suffixes 
have the same value; e.g. for the case m — 2, if A jt = Aq. 
A system is called antisymmelrical (skew) if when two suffixes 
are interchanged the element changes its sign but not its value; 
again for the case m — 2, if A tJ = — A jt . The n coefficients 
u of a generic linear form 1 

n 

<f> — S t u t Xi 
i 

constitute a simple system, which is in fact the most general of 
its kind, since, given n quantities u 4 , it is evidently always possible 
to consider them as being the coefficients of a linear form <j>. 
Consider next a quadratic form, which we may write as 

as the sum includes all permutations of the suffixes two at a time, 
the product of x k and x ) will occur twice, once as x t x f and once 
a? XjX i9 so that the coefficient of the product is A n + Aj . This 
is unchanged if i and j are interchanged; hence we see that the 
coefficients of a quadratic form constitute a symmetrical double 
system, which is the most general possible. But if we wish to 
determine a generic (non-symmetrical) double system by means 
of the coefficients of a form, a quadric in the independent vari- 
ables x is no longer sufficient. We shall now require two different 
w-fold systems of independent variables, e.g. the co-ordinates 
x and x r of two points between which no a pricrri relation exists, 
and we must construct the expression (bilinear form) 

it 

which is linear in both the x'a and the #"s; the coefficients of this 
form are the required arbitrary quantities A iy 

More generally, it is easy to see that a generic m-fold system 
is determined by a multilinear form of m groups of variables, 
while the coefficients of a form of degree m constitute the most 
general symmetrical m-fold system. 

1 The term form with respect to given arguments (e.g. the independent variables 
set, aca, . . . x n ) means a polynomial homogeneous in those arguments. 



ALGEBRAIC FOUNDATIONS 



67 



3. Invariance, covariance, and contravariance of a simple 
system with respect to linear transformations. Dual variables. 

We now proceed to examine the laws of transformation of 
systems. We shall at first limit our investigation to a linear 
change of variables and a simple system u l9 u 2 , . . . u n . 

We shall suppose that we can pass from the variables 
x to the new variables x, and vice versa, by means of the 
formulae 

n 

x, = E* c lk x k (i = 1,2,... n), . . (5) 
1 

S t - h^'x, (i - 1, 2, ...«), . . (5') 

1 

where the coefficients c are arbitrary constants whose determinant 
is not zero; the second formula follows from the first by applying 
Cramer's rule, so that c h ' is the reciprocal element of c kt (cf. 
p. 54, footnote). 

The most obvious hypothesis to make is that the u'& are 
functions of position which are transformed by invariance 
(cf. § 1). 

We get a slightly less simple, but remarkable, case if we sup- 
pose that the us are transformed by the same law as the co- 
ordinates, in which case the w's will be called contravarianls. In 
particular, the co-ordinates themselves form a contravariant 
simple system. 

Next suppose that the u'& are the coefficients of a linear 
form 

n 

<f> = S f u, x„ 
1 

and that <f> is transformed by invariance, i.e. by substituting for 
the x's the expressions (5), so that <f> is also a linear form of the 
new variables x. We shall take the coefficients of this new form 
as the transforms u of the w's; we shall then say that the u's 
form a cmariant system. 

Writing out the expressions in full, we have 
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4> = E; W;E ; . c tk x k = T, ik c ik u t x k = E* x k S, u t . 
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The new coefficients are therefore 

n 

u k = S £ c^w £ . 
i 

Interchanging i and 7j, so as to get the formute in the same 
shape as (5'), we get 

n 

Hi = Z> k c ki u k (i = 1, 2, . . . n), 
i 

which gives the law of covariance. 

Here, too, we naturally add the equivalent formulee, which 
are obtained by solving for the original elements u> and are given 
by the usual formula (Cramer's rule). Writing them first, so 
that we get them in the order corresponding to that of (5) and 
(5'), we have finally the law of covariance expressed by the two 
groups of equivalent formulee 

«, = i k c*u k , (6) 

i 

n 

u t = *L k c H u k (i = 1, 2, . . . w). . (6') 
i 

We shall frequently consider, together with the variables x 
(which are also called point variables), a system of co variant 
variables u (called dual variables); the behaviour of both sets 
of variables when a linear change of variables is made is shown 
by formulae (5) and (6). 

To find a geometrical interpretation of dual variables, we may 
fix our attention on the case n — 4, in which x l9 x 2 , # 3 , a? 4 can 
be considered as homogeneous Cartesian co-ordinates of the 
points of space. A plane has an equation of the type 

u x x ± + u 2 x 2 -f u. s x 3 + w 4 x 4 = 0, . . (7) 

where with the usual terminology, the coefficients u 2y u A 
are Plticker's co-ordinates of the plane. Now, given the geometrical 
significance of equation (7), its left-hand side must be invariant 
(except for a non-essential factor, the co-ordinates being homo- 
geneous), and hence the Plucker's co-ordinates u must be trans- 
formable by covariance. From the well-known law of duality of 
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projective geometry the v?& have been given the name of dual 
variables. Analogous results hold for any value of n. 

4. Inv&rianoe, covariance, and contravariance of an m-fold 
system with respect to linear transformations. Mixed systems or 
tensors. Vanishing of a tensor an invariant property. 

We shall now extend the discussion of the preceding section 
to systems of any order, but still limiting it to the case of linear 
transformations of the type (5), (5'). We thus define mixed 
systems, of which covariant and contravariant systems are 
particular cases. 

Consider m sets of n point variables (i.e. m points). Denoting 
by an upper index the ordinal number of each point, we get the 
set of arguments 

#1* ■ ' • *^n> 

9 9 • ■ • *"n ■ 

Consider also a certain number /x of sets of n dual variables 

Ml* • • • U n> 

2 2 2 . 

UJ, . . . u n \ 



ttf, lljf, . . . ttj. 

Construct a multilinear form F in all these variables, each 
term of F containing as factor an element taken from every set 
of the m x's and the /x u's. The coefficients of these terms, which 
are a priori completely arbitrary, will constitute a generic system 
of order m H- /x. Writing the indices corresponding to the #'s 
below and those corresponding to the w's above, we shall have 

F -= S. & 3 j A* 1 " Y x) ... x" 1 u] ...u». (8) 

Now transforming the z's by the law of contravariance and 
the u's by the law of covariance, and substituting the expressions 
so obtained in (8) (Le. transforming F by invariance), we shall 
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get a multilinear form of the new variables x, ft; we shall take 
the coefficients A of this new form as being the transforms of 
the coefficients A. We shall then say that the A's constitute 
a tensor or mixed system, cxwariant with respect to the lower 
indices, conlravariant with respect to the upper. In particular, 
m or fM may be zero, leading to the absence from F of the point 
or dual variables respectively; then the system of coefficients is 
purely contravariant if the variables in F are all covariant, and 
vice versa. 

The case of the simple system comes at once under this 
definition. In fact, F in this case becomes the <f> of the preceding 
section; if we consider it as linear in the x's, we find that; the 
coefficients u, according to the definition just given, must be 
called covariants; while if it is considered as linear in the u's, 
we conclude that the x's form a contravariant system, which 
agrees with the definitions already assigned. 

A covariant, contravariant, or mixed tensor, having m + M 
indices in all, is said to be of rank m + P> a simple system, either 
covariant or contravariant (i.e. a tensor of rank 1 ) is also called 
a vector, and its elements arc called respectively covariant or 
contravariant components of the vector. 

Following a similar method to that used in the preceding 
section to find the formula; (6) and (6'), we could find the general 
transformation formula? for mixed systems, and hence, in parti- 
cular, the formula) for contravariant and covariant systems. 
We shall not need these formulae, as in what follows we shall 
always go back directly to the definition just given. As an 
example, however, we propose to find them and give them in 
full for the simplest case of the mixed system, i.e. the system 
with a single index each of covariance and of contra variance. 

Consider therefore the bilinear form 

F = i.jA'xtU;, 

and transform it by invariance. Using formula* (5) and ((>), we 
get 

F = S« A{ c lk x k S A e* u h = Z ljhk A\ c ik c* x k u h 

111 1 

n n 
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The coefficients of this new form are 

Z* = S<,4o*c* (9) 

1 

which gives the law of transformation for mixed systems with 
two indices. We should get similarly, for the most general 
mixed system, 

2i "'Y = k h hK i '' 'k<> kii ...c k ( c''<*...cV*. (10) 

As a memcrria technica, we may add that the transformation 
formulae for the #'s and the u 9 s give an easy way of remembering 
those for a tensor of any kind. The latter are always linear, and 
the coefficients are composed of the c's in a similar way to those 
of (5) and (6): to each index of covariance corresponds a c with 
the indices below, to each index of contravariance a c with the 
indices above. The opposite holds in the inverse formulae. 

We may sum up the discussion so far in the following defini- 
tions. 

An m-FOLD co variant is an m-fold system which is transformed 
in the same way as the coefficients of a multilinear form in point 
variables', an m-FOLD contra variant is one which is transformed 
in the same way as the coefficients of a multilinear form in dual 
variables; more generally, a mixed .system or tensor is one which 
is transformed in the same way as the coefficients of a multilinear 
form in both point and dual variables (including also as particular 
cases both purely covariant and purely contravariant systems). 

The indices of contravariance are generally written above, 
those of covariance below; an exception is however made for 
the variables ,r, which are as usual denoted by # l5 
with the indices below, even if, as in the present case, we are 
dealing with a contravariant system and linear transformations. 

We shall close this section with a remark which is as obvious 
as it is fundamental whenever the notion of a tensor occurs. 
This is the fact that if all the elements of a tensor, with reference 
to a certain system of variables, vanish, this necessarily also 
happens for the transformed elements which correspond to any 
linear change of variables whatever. This is an immediate con- 
sequence of the fact that the hypothesis makes the invariant 
form F vanish identically. 
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5. Symmetrical double systems. 

Since we shall have occasion later on to deal with a remarkable 
symmetrical covariant double system, we propose to give here 
Borne properties of systems of this kind. Let the elements of such 
a system be ^ ^. (U) 

their covariance will be expressed by the fact that the bilinear 
form n 

F{x\x') = Z»o»x,*; . . . (12) 
i 

is invariant in any linear transformation which changes the x'& 
and the a?"s into other sets of variables x, x\ 

We shall first show that such a change of variables leaves 
the symmetry of the system unchanged; i.e. that 

= <h< (13) 

In fact, if we interchange the variables x 9 x' in the bilinear 

form (12), we get M 

F(x' | x) — S lfr a t> x k x\ 
i 

and since the right-hand side of this equation differs from that of 
equation (12) only by the non-essential interchange of the Jotters 
i and k, it follows that 

F(x' | x) = F(x | x 9 ) (IT) 

Vice versa, if this relation holds, wo conclude, by reversing 
the steps of the argument, that (11) is also true. 

Hence the condition of symmetry (1 1) is completely equivalent 
to the condition (11'). From this standpoint it is easily seen to 
be invariant. In fact, changing the variables, and denoting for 
shortness F >x(x) \ x'(x')} 

by F(x J in- 

equation (11') changes to the equality 

F(x' \x) = F(x\ x') 
which, as we have just seen, is equivalent to (13). 
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We could show in the same way that if a contravariant double 
System is symmetrical with respect to one system of co-ordinates, 
it is still symmetrical after any linear change of variables; a 
mixed system aj, however, has not this property. For an artti- 
symmetrical double system, either covariant or contravariant, 
we could also show similarly that antisymmetry is an invariant 
property. 

We can now use the property just illustrated to establish 
the covariance of the coefficients of an invariant quadratic form. 
Let the quadratic form be 

<f>(x) = Zna+z^Xt (14) 

1 

Changing the variables, <f>(x) evidently becomes a quadratic 
form in the x's, which we shall write 

<j>(x) S^a^x* (14') 

i 

We shall show that the coefficients a ik are the transforms by 
covariance of the coefficients a lk9 or in other words are the same 
as would be obtained by changing the variables in F(x \ x'). 
In fact, we get <f>(x) from F(x | x') by first putting x' equal to x, 
or 

4>{x) = F(x\x), 

and from this, with the usual change of variables, we then get 
^(x), which is thus derived from F (x\x') by applying successively 
the two operations 

x i — x i9 (a) 

a, = ^(*) (b) 

But the same result will obviously be obtained if these two 
operations are applied in inverse order, i.e. if we pass first from 
F(x | x') to F(x | x') (the coefficients of which are by definition 
the transforms by covariance of the coefficients a lk ) 9 and then, 
by the operation (a), which implies x' — x and on account of 
symmetry does not change the coefficients, to <f>(x); the co- 
efficients of this last expression are therefore the transforms by 
covariance of the coefficients a^. 



74 



INTRODUCTORY THEORIES 



6. Sets of n covariant and contravariant simple systems. 
Theorem on reciprocal sets. 

We now propose to prove a lemma in which we shall have to 
consider, not a single simple system, but a set of n covariant 
simple systems. We must therefore distinguish the elements in 
question by two indices, one showing the ordinal number of the 
system from which an element is taken, the other (which will 
be an index of covariance or of contravariance) showing the 
ordinal number of the clement in that system. Consider, there- 
fore, the set of n covariant simple systems 

K\i ( a > * = -1, 2, . . . n), . . . (15) 

where a represents the ordinal number of the system and is 
therefore not an index of either covariance or contravariance; 
and suppose further that the determinant of the A\s does not 
vanish, or in other words that the n systems are independent. 
With this hypothesis, to every element A a | £ will correspond a 
reciprocal element (its algebraic complement or minor divided 
by the value of the determinant), which we shall denote by 

A! (a, i = 1, 2, . . . n). . . . (15') 

In a linear change of variables the terms A rt) ,- will be transformed 
by the law of covariance, and the transforms will be denoted by 
K\ii we s hall ta ^ e the reciprocal elements A^ of the terms A a]l 
as representing the transforms of the reciprocal elements A*. 

We shall now show that this law is identical with contra- 
variance, i.e. that giving a the values 1, 2, ... n, the terms A^ 
constitute n contravariant simple systems; or shortly, that the 
reciprocal s^t of n covariant systems is a set of n contravariant 
systems. This is the reason for placing the index i above. 

The hypothesis of covariance of the set of n systems (15) 
means that the n linear forms 

r a = 2, A a|l x t (a = 1, 2, . . . n) 
i 

are invariant. What we have to prove is that the n linear forms 

71 

i/t a = ZtKuf (a = 1, 2, . . . n), 
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are also invariant, i.e. that 

0 a -^ = S,Ali7 i -S ( A>,= O (a=l,2,...»), (16) 

1 1 

whatever the u's may be. Now these last expressions are linear 
in the w's (since the u 9 & are merely linear combinations of the 
w's), so that each of them is of the type 

i 

To show that this vanishes identically (i.e. that all the f's are 
zero) we need only show that it vanishes when we give the u's 
n distinct sets of numerical values, as we shall then have n homo- 
geneous linear equations in the £'s, whose determinant does not 
vanish (this condition being implied by the use just now of the 
adjective distinct). We shall give the w's the values 

08, i = 1, 2, . . . n), 

and hence, from the covariance of these quantities, we shall have 
to give the w's the values A |3l£ . Using a property of determinants 
(given as formula (/?) in the footnote on p. 55), and substituting 
in equation (10), we get 

£ - 4>« = K - K = 0 (a, 0 = 1, 2. . . . n), 

which proves the result required, 

7. Addition of tensors. 

Take two tensors (in general mixed) of the same kmd, i.e. 
having the same number of indices of covariance and the same 
number of indices of contravariance (in particular, two covariant, 
or two contravariant, systems of the same order): 

H - ■ • *w »i . . . t m 

Summing corresponding elements (those with the same indices) 
we get a new system whose general term is 

depending on the same number of indices. We shall show that 
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this new system is also a tensor, covariant and contravariant 
respectively with respect to the indices of covariance and contra- 
variance of the given systems, so that with the notation previously 
adopted the general term can be written 

fih • • • jfi 

To simplify the formula we shall prove the result for the 
case of a single index each of covariance and of contravariance; 
the reasoning is identical in the general case. Our hypothesis 
then is that the forms 



O « 2py B\x,u p 
are invariants. The sum 

F + <I> - Eg (A\ + B$ x L uj = E„ CP Xi u j9 

will therefore also be an invariant, which is as much as to say 
that the system 

is covariant with respect to the lower index, and contravariant 
with respect to the upper. 

The tensor G is called the sum of the two tensors A and B. 



8. Multiplication of tensors. 

We shall now define the product of two tensors. These may be 
of any kind, in general mixed; we shall suppose that one has m 
indices of covariance and /a of contravariance, and the other m' 
and respectively, so that they are represented by 

A jy^ B J y-y. 

Construct the system whose general term is the product of 
any element A by any element B; the element so formed will 
contain m + /z. + m' + indices, so that the rank of the 
'product system will be the sum of the ranks of the given systems. 
We shall show that it is a tensor which has the m + m' indices of 
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covariance of the given system as indices of covariance, and the 
fi + indices of contravariance as indices of contravariance. 
To simplify the formulae we shall as before consider the case of 
only two indices. 

Let the two forms which by hypothesis are invariant be 

f = ik+AiztUk, 

1 

<E> = 2 jk B$Xju' k . 

Their product will also be an invariant, and is 
F® = iupAtB'ziqutUt, 

or, putting 

A1B) = Oft 

FQ> = i^C^XiXj^ul. 
i 

The invariance of this form means that the indices i and j 
attached to the letter C are indices of covariance, and h and k 
are indices of contravariance, which proves the statement just 
made. The argument is the same in the general case. 

9. Contraction of tensors. 

We shall now define the operation of contraction, by which 
we pass from any mixed system to another system having one 
index of covariance and one of contravariance less than the 
first. 

For convenience of printing, we shall give explicitly only 
one of the indices of covariance and one of contravariance, 
replacing the others by points, so that we shall put 

to represent the general term. 
Now construct the system 

i 

which will contain all the indices, except the two shown on the 
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right; we say then that the tensor has been contracted with 
respect to these two indices. We shall show that the system 
so obtained is also a tensor, having the same indices of covariance 
and of contravariance — except of course the pair used in con- 
tracting—as the given tensor. To simplify the formulae, we shall 
as usual consider a particular case, but one not differing essentially 
from the general case. Suppose, therefore, that the form 

F = E ihl8 A^x t x r u h u[ 
i 

is invariant whatever may be the variables x, x\ u, u\ the only 
restriction being that x, x f are point variables and u, u' dual 
variables. Their values being arbitrary, we may replace the 
variables u' by n distinct systems of covariant quantities, which 
we shall denote by A a j w , using the notation (15) of § 6; we can 
then replace the variables x[ by the quantities W a , which are 
the reciprocal elements of the former group, and therefore con- 
travariant (§ 6). We shall thus have the n linear forms 

F a « i^A^^KKiM (a = 1, 2 n) 

i 

all invariant. Their sum G will therefore also be invariant. 
Writing out this sum, and making some slight transformations, 
we get (remembering the fundamental property of reciprocal 
elements) 

G - = l^A^x.u^KK^ = 'i^A'Sx^K- 

Now we know that = 0 if r 4= s and = 1 if r -~ s, hence 
all the terms in the sum for which r =f= « will disappear, and there 
remains 

G = £ lhr A'Zx,u h = £ th x £ u h £ r A% = £ M x f t^Df. 

X 111 

The invar iance of this form shows, as was required, that the 
system n 

J>h y« Ahr 

1 

is a tensor covariant with respect to the i's and contravariant 
with respect to the A\s. 
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The operation of contraction can evidently be repeated several 
times, contracting successively with respect to various pairs of 
indices, so that, for example, from the system 

we can pass, by using two pairs of indices, to the tensor 

If the process is applied to the only pair of indices of a mixed 
double system, the result is an invariant: 

A - hi A). 
i 

10. Composition of tensors. 

if we combine the operation of multiplication of two tensors 
with that of contraction, we get the operation called composition 
(or inner multiplication) of two tensors. We shall write the two 
tensors in the abridged form 

A\ > B ' *> 

where we show only a single index of covariance for one and of 
contravariance for the other. 
The tensor 

c\. - u;; f z?;.; r 

is said to be compounded of the first two or is called their inner 
product; its indices of covariance are those of A, except r, and 
all those of B, and its indices of contravariance are all those of 
A y and those of B, except s. 

It can at/ once be seen that the system C is a tensor, observing 
that it is obtained by contraction with respect to the indices 
r and s from the system 

r;;;* = a\ >r b \ 

which is itself obtained by taking the product of the given 
systems. Thus, for instance, compounding the systems 

Ah T>to 
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with respect to the indices r and s, we get 

u 

Chk y Ah j*kr 

11. Change of variables in general, m-fold systems whose 
elements are functions of position. First general definition of a 
tensor. Typical tensors of rank 1. 

Up to this point we have considered only linear changes of 
variables, and we have defined, with reference to them, covari- 
ance, contravariance, and the fundamental operations on systems. 
We shall now extend these definitions to any change whatever 
of the variables. 

Suppose, therefore, that the formulae of transformation, instead 
of equations (5), are 

*>i — M^i* $2> • • • *n) (i ■"■= 1, 2, . . . n), (17) 

where the //s denote arbitrary functions, except for the quali- 
tative restrictions as to differentiability, &c, which will be 
tacitly imposed whenever necessary, and the condition that the 
transformation is reversible, i.e. Miat the equations (17) are 
soluble for the x's and can therefore also be given in the equivalent 
form 

$i = & # 2 > • - • T n)- • - - (17') 

The general transformation (17) involves a linear transfor- 
mation of the differentials. In fact, putting 

^ = c lh H = c«, . . . (18) 

V x k V x k 

we get, differentiating (17) and (17'), 

" dx- n 
dx, - S A ~J-dx k = Y> k c m dx k \ . . (19) 
1 cx k i 

dXi == E*?^ dx k = J: k c ki dx k (i = 1, 2, . . . n). (19') 

The second of these groups of formulae must be identical with 
the group which would result from solving the first; the quantities 
c H must therefore be the reciprocal elements of the quantities 
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c H$ which justifies the choice of these symbols to represent the 
derivatives. 1 

From the analogy of formulas (19), (19') to (5), (5'), we can 
at once extend the earlier arguments to m-fold systems whose 
elements are any functions of position (i.e. of the independent 
variables x l9 x 2 , . . . x„). We shall say that an m-fold system 
whose elements are functions of position constitutes a tensor, 
co variant, contra variant, or mixed, with respect to a generic 
transformation (17), when it is a tensor of the specified kind 
(at every point of the field considered) with respect to the linear 
transformation (19), (19') between the differentials of the old and 
the new variables. 

In consequence the differentials of the independent variables 
provide us with the typical contravariant simple system. We 
shall next consider what is the typical co variant simple 
system. 

In § 3 we introduced the dual variables u t9 which were formally 
defined as the coefficients of a linear form in the variables x. 
These latter are now to be replaced by their differentials dx, so 
that we start from a generic Pfaffian 

n 

iff £;U<<fc* 
1 

and consider it as invariant for any change whatever of the vari- 
ables x. The coefficients u are considered as functions of position, 
and hence initially of the x's. When the transformation (17) 
is made, the dependence on the point co-ordinates is expressed 
instead in terms of the new variables x. Substituting in tfs for 
dx L from (19), we see in the first place that we still have a Pfaffian 

1 This can also be shown directly, by proving that the terms c kl and c** have 
the fundamental property of reciprocal elements. Jn fact, if in equations (17) 
we replace the x's by the expressions given by equations (17'), they reduce to 
identities. Differentiate one of these with respect to a?*, using the rule for a 
compound function. We shall have 

Now the left-hand side is 0 or 1 according as i =|= & or i 
side we can introduce the notation (18), so getting 

i 

which proves the required result. 

(DCKifi) 



- 1c; on the right-hand 
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in the new variables x; this is obvious, since the original expression 
is linear in dx. Writing out the result, we get 

i i Csc* i ox k i i Co; /f 

The coefficients of the new differentials dx k , i.e. the elements 
u k of the system which is the transform of the coefficients u k , 
are therefore 

n dx 

u k = £„i/ ' (k = 1,2,.,, n). 
i dx k 

Interchanging i and X" and adopting the notation (18), we get 

the law of transformation for the coefficients of a Pfaffian expressed 

by the formula) n 

u t = 2 fr c kt u k9 
1 

which are identical with the formulae (6'). Adding the inverse 
formula) and replacing the coefficients c k( , d k by their values as 
given by (18), we get the transformation formula, for the coefficients 
of a Pfaffian (an invariant) which constitute the typical simple 
covariant system, in the explicit form 

u t = (20) 

i dx h 

u t = E^a!- 1 " <• = 1, 2 n). . (20') 

Suppose in particular that the (invariant) Pfaffian is the exact 
differential of a function u of position; being invariant, u is such 
that its expression in terms of the #\s is obtained from its expres- 
sion in terms of the x'a by substituting f> (£) for x l9 and vice 
versa, so that the formula 

u(x) — u(x) 

is an identity when we substitute in it the expressions given by 
(17) (or (1 7') ) for the x'a (or the x'a). 

S u 

The coefficients u l9 u { of the Pfaffian are respectively or 

du 1 
according as du is considered as expressed in terms of the x'a 

d x t 

or of the x'a. 
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It follows that the derivatives of an invariant are transformed 
by covariance, the law being given by formulw (20), (20'). 

Vice versa, to obtain the formulae of covariance (20) or (20') 
relative to a simple system, without having to go through all 
the steps from the beginning or to remember them by heart, 
the easiest memoria teehnica is to consider the elements of the 
generic system in question as being for the moment the deri- 
vatives of a single function, and to apply the rule for differentiating 
a function of one or more functions. We then automatically get 
formulae (20) or (20') according as we start from an original or 
a transformed element. 

The direct transformation of the differentials further, as we 
have seen, gives formula? (19) and (19'), which we can use as 
the transformation formulae for a generic contravariant simple 
system, by substituting the original elements £ ' for the differentials 
dx, and the transformed elements £ k for the differentials dx k . 

To sum up, the differentials of the independent variables and 
the derivatives of a single function give what we may call the 
pattern of the transformation formula) for simple contravariant 
and covariant systems respectively. 

12. Second general definition of tensors whose elements are 
functions of position. Examples. 

Take a multilinear form in any number of sets of contravari- 
ant variables (i.e. having the same law of transformation as the 
dx t ) and in any number of sets of covariant variables (i.e. having 

c) v \ 

the same law of transformation as the u t ■= 1. Let the co- 

dxj 

efficients be considered as functions of position, and the given 
form as invariant at each separate point. From the definition 
given in the preceding section it is clear that the coefficients 
form a mixed tensor, whose indices of covariance are those relative 
to the contravariant variables, and vice versa. Reciprocally, 
every tensor, in the sense of the first definition, can be identified 
with the coefficients of a multilinear form of the kind just 
described. The two definitions are therefore completely equi- 
valent. 

From this point everything is analogous to what was said in 
§ 4, and we may therefore dispense with further details, except 
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to repeat once more explicitly the remark made at the end of 
§ 4 as to the vanishing of a tensor (i.e. of all its elements) being 
an invariant property. The property holds in general for any 
change of variables of any kind. In other words, if all the elements 
of a generic tensor 

*1 ** • • ■ * V 

referred to a particular system of variables, are zero, we may 
be sure that the equations 

continue to hold however the variables may be changed. 

We shall close this section with two examples of tensors 
which occur fairly often. 

Consider first a linear operator A y where 

i ox, 

whose coefficients A 1 are specified functions of position. Let us 
treat the operator as an invariant. Then since the terms 

are covariant, it follows that the A^s are by definition contra- 
variant, and must therefore have their law of transformation 
given by the equations (19'), so that we get for the transformed 
coefficients the expressions 

A} = i k A*l*\ 
i dx k 

as could easily be verified directly. 

Consider next a differential quadratic form 

n 

<f> = ^ik^ikdxidx,,, 
i 

which is to be invariant; the coefficients a ik (in general to be con- 
sidered as functions of position) will then be covariant, and hence 
their transformation formulae will be 

S_ 3 x r ^ Xg » 
r* a r«* 5 — > ? • • • 

I OXi cx k 
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or (solving for the transformed elements) 

a* = s„«4* r I? (21 '> 

1 ox i ox k 

13. More complex laws of transformation. Scope of the 
Absolute Differential Calculus. 

In a generic change of variables a system, as we have said, 
is transformed in a way which depends on its definition. The 
cases so far examined have been the simplest, but others of 
considerably greater complexity may also occur; we shall now 
give an example of these. 

We have seen that the simple system composed of the first 
derivatives of an invariant function u is covariant; we now 
proceed to examine the double system of the first derivatives 

* of a covariant simple system u t . As a particular case, if the 
w/s are the derivatives - — of a single function u, we cover the 

O X; 

case of the transformation of the second derivatives of an in- 
variant function. 

To find the transformation formulae for this system, i.e. the 

relation between the terms and the terms we start from 
the transformation formula for the u^s: 

1 VXt 

Differentiating it with respect to x Jy and considering the w/s 
on the right as functions of the x'b and therefore of the £'s, we 
get 

P* = 3u * X * |S p + i J* x * u k . . (22) 
oxj 1 ox t OXj ox A 1 dx t dXj 

Tf the last sum were absent, the law of transformation would 
be that of covariance. But in fact the presence of the second 
derivatives of the x's with respect to the x's shows that the 
system we are examining is neither invariant, nor covariant, nor 
contravariant, nor mixed, and therefore is not a tensor, its law 
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of transformation is more complicated than any we have yet 
examined. A similar result is true more generally for the system 
composed of the derivatives of any tensor. 

It is often necessary to consider the derivatives with respect 
to the independent variables of the elements of a tensor, eovariant, 
contra variant, or mixed. In order to avoid the complication 
just observed, it is therefore convenient to replace these deriva- 
tives by linear combinations of them with the elements of the 
tensor, so chosen that those terms which lead to the aforesaid 
complication disappear in the transformation formulae. This is 
the problem which the Absolute Differential Calculus proposes 
to solve; it does so, as we shall see farther on, by introducing 
an auxiliary element, namely, an invariant differential quadratic 
form. We shall therefore devote the next chapter to the study 
of this important element. 



CHAPTER V 

Geometrical Introduction to the Theory op 
Differential Quadratic Forms 

(a) The Line Element on a Surface 
1. Parametric equations of a surface. 

The meaning of the term " parametric equations of a sur- 
face" is known from analytical geometry. We propose, however, 
here to examine the idea from the beginning, in order to find the 
formulae in the shape which is best suited to our purpose. 

We shall use the letters y l9 y 2 , y 3 throughout this chapter 
to represent the Cartesian co-ordinates of the points of space 
referred to three orthogonal axes. Now consider a surface, or 
more generally a piece of a surface a, to which alone the following 
remarks are understood to apply, and suppose that there has 
been established, in any way whatever, a one-to-one correspon- 
dence between the points of a and the pairs of values which can 
be assigned to two parameters x v x 2 within a certain field C 
of a plane representative of the arguments x x , x 2 (cf. the general 
remarks in Chapter I, § 1). 
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This implies that the points of a and with them their Cartesian 
co-ordinates y v are definite (and finite) functions of x v x 2 in the 
field C. We shall accordingly write 

y v = yA*!,**) = 1,2,3),. . . (1) 

where for subsequent purposes the three functions y v must have 
derivatives, to any order which we may have occasion to consider, 
which are continuous in the field C. 

But this behaviour of the functions is not in itself sufficient 
to ensure that the equations (1) do effectively define a surface, 
i.e. that the supposed one-to-one correspondence does in fact 
exist between C and the points of a portion of a two-dimensional 
manifold. 

It might for instance happen that only the sum x x + x 2 
appeared in the equations (1), in which case the dependence on 
two parameters would be only apparent, only one of them being 
essential. In this case the equations (1) would define a piece 
of a curve. To exclude the possibility of anything of this kind 
we shall suppose that two of the equations (1) are soluble (within 
G) for x l9 x 2 , so that by solving them, and substituting the 
values so found in the remaining equation, we can get one (and 
only one) relation between y l9 y 2> y s , i.e. the equation of a surface. 

This is equivalent to imposing the condition that the char- 
acteristic of the functional matrix 1 of the equations (1) is 2. 
Then the equations (1) will actually represent the parametric 
equations of a piece of a surface or; and it could be shown that 
— with the restriction, if necessary, of the field C to a convenient 
portion T of itself (around an arbitrarily chosen point)— the 
portion of surface so defined is such that to any point on it there 
corresponds one and only one set of values of the parameters 
in the field I\ Accordingly, with this qualitative restriction as 
to the field — which we shall always consider as being of the 
type r — in which the parameters x^ 9 x 2 are made to vary, we 
are quite justified in calling x l9 x 2 curvilinear co-ordinates on 
the surface a defined by equations (1). 

Giving x x a constant value, and making x 2 vary, we get all 
the points of a line, which we shall call the line x x = constant, 
or the line x 2 , or more shortly, the line 2 (since only x 2 varies along 



1 See 6, 7, pp. 8 12. 



88 



INTRODUCTORY THEORIES 



it); in the same way we can define the lines cc 2 = constant, or 
the lines x l9 or merely the lines 1, as those along which only x x 
varies. We can thus think of our surface (or portion of surface) 
a as covered by a double network of lines (co-ordinate lines) 
such that two and only two— one line x x and one line x 2 — pass 
through every point of it, 

2. Expression for ds 2 . 

We shall now fix two infinitely near points, P, P', on a; 
let their curvilinear co-ordinates be 

x l9 x t -f dx, (i = 1, 2), 

and, subject to the equations (1), let 

y v , y v + dy v (v - 1, 2, 3) 

be their Cartesian co-ordinates. 

Note that in order to specify a point P on a, we may take 
arbitrarily (within T) the two co-ordinates x 1 , x 2 ; and so also, 
in order to reach P', the two increments dx v dx 2 . 

The y's are defined by the equations (1), so that their differ- 
entials are connected with the dx's by the equations 

dy v - iil y 'dx, (v = 1,2,3), . . (2) 
i dx l 

which are obtained by differentiating the equations (1). 

We shall calculate the distance PP' — ds, or rather, as 
being more direct, its square 

ds* S,<iy r ». 
i 

Substituting the expressions (2) for the dy 9 s, we shall have 

i i dXi dx k i i uXi ox h 

from which, putting 

n _ V d v* d v* <*\ 

Q'lk — - - W) 

i o%i vx k 
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(by which we define a very important symmetrical double system 
of regular functions of the x's), we get 

2 

ds 2 = H ik a £k dx i dx k (4) 

This quadratic form, which, as we shall see, is fundamental 
for the study of the metrical properties of our surface, is an 
obvious generalization of the expression 

ds* = dx* + dy* 

which in Cartesian co-ordinates gives the distance between two 
infinitely near points of a plane. 

We shall now show that the form (4) is definite and positive 
i.e. that it never becomes either zero or negative, whatever 
values (real and not zero) are assigned to the dx'a. That it 
cannot be negative is at once seen from the fact that it is 
the sum of the squares of the dy's, which are always real if 
the rf^'s are real. It could therefore vanish only if all the dy's 
vanished, and we shall show that this is impossible for any 
actual displacement (one in which dx x and dx 2 are not both 
zero). 

In fact, let us try to suppose that we can have 
dy } ^ dy 2 =■-■ dy 3 « 0. 

Using equations (2), these become three linear homogeneous 
equations in dx v dx 2 . In order that any two of these may be 
satisfied by non-zero values of these variables, the corresponding 
determinant must vanish; since we may choose arbitrarily the 
pair of equations to be satisfied, we conclude that all three 
of the functional determinants (of the second order) of the 
y's with respect to the cc's must vanish, which contradicts 
the hypothesis that the characteristic of the functional matrix 
is 2. 

The general theorem relating to simultaneous linear homo- 
geneous equations could also be applied directly; namely, that 
the number of independent solutions is the difference between 
the number of the unknowns and the characteristic of the matrix 
of the coefficients — in our case 2 — 2, or 0. 

From the proof that the quadratic form under discussion is 
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definite, it follows, by a known theorem 1 on quadratic forms, 
that the determinant 



a = 



<*>ik 



composed of the coefficients of ds 2 (called the discriminant of 
the form) is not zero; in particular, when, as in the present case, 
the form is positive as well as definite, we have specifically a > 0. 

The fundamental form (4) calls for one last remark, almost 
obvious but important. This is that the system of the coefficients 
a^ is covariant with respect to any transformations whatever of the 
variables x v x 2 (which justifies our having placed the indices 
i 9 k below). This covariance follows directly (applying a remark 
made at the foot of p. 84) from the invariance of the quadratic 
form ds 2 . 



3. Determination of the directions drawn from a generic 
point. 

In the space y x , y 2 , y 3 , a direction drawn from a generic point 
P may be considered as determined by an infinitesimal segment 

1 The theorem referred to is as follows. Let 

n 
i 

be a definite quadratic form in n variables; we shall show that its discriminant a 
cannot be zero. 

In fact, putting n 

y % = 2 k a lk x k {i = 1, 2, . . . n), 
i 

n 

we get 0 = i] v // t as£. 

l 

Now if a = 0, we could make 0 — 0 without all the x's being" zero (contrary 
to the hypothesis that the form is definite) ; we should only have to make all the 
y*s zero, by solving the n linear homogeneous equations 
n 

^ = 0 (» = ], 2, . . . n), 

l 

which would lie soluble, giving values for the ar/n which are not all zero, provided 
a = 0. 

For definite positive forms a is therefore also necessarily positive. One way of 
seeing this is to apply one of the infinite number of (real) linear substitutions 
which reduce 0 to the canonical form (see e.g. Bianchi: Lezioni di geamctria 
analitioa, Appendix, pp. 671-592 ; Pisa, Spoerri, 19'20). It is obvious that a 
positive form which contains only squares of the variables has its discriminant 
a > 0. But the original a and a are connected by the relation a = «A 2 , where 
A denotes the determinant of the linear substitution. (See p. 157.) We 
therefore necessarily also have a > 0. Q.E.1X 
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having one end at P, or if preferred by another point P' infinitely 
near P, or, which comes to the same thing, by an infinitesimal 
displacement of P. 

Now suppose that P belongs to <r, and consider the directions 
drawn from P which are tangent to the surface. To determine 
them we have to take points P', infinitely near P and belonging 
to a. If therefore we call the surface co-ordinates of P x ± and 
x 2 , we can determine P' by the surface co-ordinates x x -f- dx v 
x 2 + dx 2 . 

Thus to each pair of infinitesimals dx l5 dx 2 , there corresponds 
one and only one tangential direction drawn from P. To one direc- 
tion, on the other hand, there correspond an infinite number 
of pairs of differentials which differ from each other by a (positive) 
factor, since the length ds of the segment PP' chosen to deter- 
mine the direction is a priori arbitrary, the only condition being 
that it is infinitesimal. 

In order to make the correspondence one-to-one, we shall, 
in order to determine a direction, replace the differentials by the 
proportional quantities 

\i . _ \2 — dx 2 

ds ds 

these are unchanged if we multiply dx x and dx 2 by a positive 
factor h, since it follows from equation (4) that then ds is also 
multiplied by k. 

These quantities are called parameters of the direction and 
obviously reduce to direction cosines when the surface cr is a 
plane and x v x 2 represent orthogonal Cartesian co-ordinates. 
The parameters are not independent but are connected by the 
relation 

% J( a lk XX k - 1, .... (5) 
1 

which is obtained by dividing equation (4) by ds~ and which 
corresponds to the well-known identity for the Euclidean plane 
that the sum of the squares of the cosines 1. Since ds is an 
invariant and the dx'a are contravariants, the parameters are 
also contravariants, which justifies our having placed the indices 
above. 
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Instead of the parameters two linear combinations of them 
are sometimes used; these are 

A; - S*a*A fr (i = 1, 2), . . . (6) 

1 

which are called moments. Since the coefficients form a co- 
variant double system (cf. § 2), and the parameters, as we have 
just shown, form a contra variant simple system, it follows that 
the moments are covariants. 1 

We showed in § 2 that the determinant a is not zero; the equa- 
tions (6) can therefore be solved, giving the formulae 

A f - S fr «'*A t) (6') 

1 

which give the parameters in terms of the moments. The para- 
meters and moments are connected by a particularly simple and 
remarkable bilinear relation, which follows immediately from 
(5) and (6). In fact, multiplying the equation (0) for the generic 
index i by A', and summing for % =-- 1 and i — 2, we get from (5) 

2 A t A' — 1 (C) 

i 

It follows directly that the moments also are connected by a 
quadratic relation. We need only substitute in (5') for A' the ex- 
pression given by formula ((V), which gives at once 

L«'*AA - 1 (5") 

1 

4. Angle between two directions. Contravariance of the 
coefficients a! k . 

Consider two directions on a surface drawn from a single 
point P. We shall denote them by X and (Jt, where these two 
symbols mean more precisely the two unit vectors which deter- 
mine the given directions. The parameters and the moments 
of X will be denoted by A*, A,-, and those of jx by fi\ f/,,, respectively. 
We propose to find the angle & between the two directions as a 
function of the parameters or of the moments. 

Denoting the increments of the co-ordinates y„ 9 x L by dy ¥ , 



1 Cf. § 10, p. 79. 
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dxi respectively, the direction cosines of X, for a displacement 
ds along X, will be 

as 1 ox i ds 1 ox l 

Similarly, denoting by the symbol 8 the increments of the co- 
ordinates for a displacement 8s along jjl, we have for the direction 
cosines of this direction 

Hence, from the usual formula) of analytical geometry, we 

get 

cos» - s »*■ = ii ^ av = Lwkl 9 - 

i r/.sr bs 1 1 tfX; da^ 1 1 ox t ox k 

and therefore finally 

cosft = 2^a, 7 .AV (8) 

1 

Substituting for /x /f , or A 1 ", or both, their expressions in terms 
of the moments, we get for cos& the following equivalent expres- 
sions: 

o 

COS* t, X'fJL,, (8') 

cos* = 2, A, (8") 

1 

o 

cos* ^ 2,*a' /r A, /z. (8"') 

1 

The last of these formulae enables us to see that the notation 
cH k is in agreement not only with the convention adopted for 
reciprocal elements, but also with that of waiting the indices of 
contravariance above. For putting 

u t s= X t ds, v k = fi A 8s (i, k = 1, 2), 
where we note that the u 9 s and the v'b are independent variables 
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(not connected by any relation as are the A's and the yx's), we can 
write equation (8'") in the form 

2 

dsSs cosft = S tA . a ik UiV k \ 
i 

since the left-hand side is invariant, and the right-hand side con- 
sists of a bilinear form in two sets of arbitrary covariant variables, 
it follows that the coefficients a ik are contravariant. 

To find sinft, we can form the product by rows of the two 
determinants 



A 1 


A 2 




\ 


A 2 






X 




M 1 


f* 2 




Mi 


M 2 



Applying formulae (5'), (8'), and (8"), this becomes 
1 cosft I 



cosft 1 
We therefore have 

shift 



1 — cos 2 ft -— sin 2 ft. 



x 



A x A 3 
Mi M 2 



(9) 



where the radical must have the sign +, since by definition the 
angle ft between two directions always <T tt, and therefore sin ft 
> 0. 

The expression (0) can be put in another form. It is easy to 
verify that 

X x A2 «n a 12 

and therefore 

I Ai A2 

■ O') 
. . . (9") 



i A1 


A 2 


= a 


A 1 


A 2 


' iM 1 


M 2 




M 1 


M 2 



or also sinft = 





A* 


A 2 


*J a 




M 2 


M 1 


1 


Ai 




s/ a 


Mi 


f*2 



where in each case it is understood that the radical \/ a is to have 
its arithmetical value. 
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5. Associated, and in particular reciprocal, tensors. The 
typical example of the parameters and moments of a single direc- 
tion. 

Take a generic tensor (with reference to the variables x v x 2 ) 

*i hi • • • *<« 

of rank m + l*\ if we compound it with the coefficients a ik 
of our expression ds 2 , we can transfer any one of the indices h, 
say h l9 from above to below, so getting 

*> 

jJl<>. .hp _ y A*h % . .hp 

which is a tensor of the same rank, but with an index of covari- 
ance more and an index of contravariance less, namely h v Simi- 
larly, compounding with the contravariant double system com- 
posed of the reciprocal elements a ,k (cf. § 4), we can transfer 
any one of the indices of covariance, say i lt from below to above. 
We need only put 

2 

*i ■ • i,n * As • • 

in which the system C is also a tensor of the same rank. 

These operations can obviously be repeated, so as to transfer 
not one but several or even all of the indices of the given tensor. 
All the tensors so obtained are called associated tensors of the 
tensor A * 2 so that association is a relation which is 

dependent on a given ds 1 . In particular the tensor 

^% h , . . . A„ 



2 

= S, , k k ,, a liji a 1 ^ . . . a 1 '"-'" 1 a A , r a* , f ... a h Tf A* 1 * 1 ' 



in which the indices of covariance are the same as the indices of 
contravariance in A, and vice versa, is said to be reciprocal to 
A, the use of the term being justified by the consideration that 
the relation is reversible, A being the reciprocal of Z in the same 
sense as Z is of A. This can be shown explicitly if we suppose 
the above formulae denning the system Z solved for the A's. 
Equations (6) and (6') show that the parameters and moments 
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of a single direction form a particidarly simple and striking 
example of a pair of reciprocal systems. 

Remark /.—The definition of the tensors associated with a 
generic tensor A involves essentially a specific ds 2 , whose co- 
efficients a ik and their reciprocals al k form part of the definition. 
When it is necessary to emphasize this fact, we can do so by 
speaking of the tensor or tensors as associated with respect to the 
ds 2 in question. 

Remark //.—For the symmetrical co variant double system 
a lk and the contravariant system composed of the reciprocal 
elements a tIl \ from which the associated tensors are constructed 
by composition, we coxdd plainly take the coefficients of any 
other invariant quadratic instead of those of ds 2 (provided 
only that <f> is irreducible, so that the reciprocal does in fact 
exist). We should then have associated systems with respect 
to the quadric cf>. 

Remark III. — We may point out at this stage that the 
idea of associated systems holds good as it stands for any 
number of variables x v x 2 , . , . x tl . We need only suppose that 
the indices take the values 1, 2, ... n, and that the auxiliary 
element is represented by an irreducible differential quadratic 

a 

form <f> ~ a ik dx t dx f , in n instead of in two variables. 
i 

6. Surface vectors. 

Let R be a non-zero vector drawn from a point P of the surface 
a, tangentiaily to the surface; we shall call it a surface or tan- 
gential vector, and we can determine it by its Cartesian components 
F„ (v = 1, 2, 3) or, in closer agreement with its intimate relation- 
ship to the surface, by its magnitude R and its direction, the 
latter being determined by its parameters A' or its moments 
A;. These three quantities are not independent, since the para- 
meters (or moments) are connected by the usual identity; the 
vector is therefore determined by two essential quantities. It 
will accordingly be convenient to represent it by the two inde- 
pendent quantities 

R* = R\* (i = 1, 2), . . . (10) 
or alternatively by the pair 

Bi - R\ {i - 1, 2), . . . (10') 
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which are called respectively the contravariant and comriant 
components of the vector. 

These obviously form a pair of reciprocal systems, since by 
the preceding section the parameters and moments are reciprocal, 
and equations (10) and (10') show that R ! and R, differ from the 
parameters and moments only by a common factor R. 

R can be calculated from them by means of the identities 

^a+B'R* - ft 2 , ... (11) 
k k a ik R t R k = ft 2 , ... (11') 

i 

itR.R* -* R 2 (II") 

1 

(which are merely (5), (5'), and (5"), each multiplied by ft 2 ), 
and then A' and A, follow from equations (10) and (10'); thus 
we see that the vector is completely determined by its contra- 
variant (or by its covariant) components. 

To find the relation between the contravariant components 
and the components Y t . with respect to Cartesian axes y ls y 2 , 

note that the direction cosines of the direction whose para- 
meters are the A's are given by equation (7), and hence the 
components Y v (which are equal to these cosines each multiplied 
by R) are given by the equation 

Y p = i&'R? (12) 

1 QXt, 

It is now obvious that the covariant components can ba 
obtained from the contravariant components, and vice versa, 
by means of formulae completely analogous to (6) and (6'), and 
obtained from these by multiplying them by R f 

If we have to deal with zero vectors, i.e. having their length 
R zero and their direction indeterminate, we find that in order 
to satisfy equations (10) and (10') in this limiting case we have 
to take R l — 0, R t ~- 0. With these values all the other 
equations ((11), (11')? &c.) are also satisfied, as can at once 
be seen from the fact that both sides of each equation vanish 
separately. 

By an analogous procedure we can find simple expressions 
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for the scalar product R X V of two surface vectors R, V, re- 
membering that if ft is the angle between the vectors we have 

R X V — RVcosfr (13) 

In fact, considering first the general case of two vectors neither 
of which is zero and whose versors are X and (jl respectively, and 
multiplying equation (8') by RV 9 we get 

RXV=&,&V„ . . . . (14) 
i 

while the equations (8), (8"), and (8'") would give analogous 
formulas. 

The expression (14) for the scalar product also holds, like 
formula (13), when one or both vectors are zero, the scalar pro- 
duct (by definition) and the right-hand side being then zero in 
both formulae. 

7. Parameters and moments of the co-ordinate lines. Element 
of area. 

We shall next obtain the direction parameters of a co-ordinate 
line, e.g. the line a?j (i.e. x 2 = constant), considered in the 
direction of x increasing. For an infinitesimal displacement in 
this direction, we have 

dx 2 — 0, da 2 a n dx* + 2« 12 dx x dx 2 + a 22 dx. 2 2 = a n dx*. 

Since ds is essentially positive, and dx 2 is positive by hypo- 
thesis, we have, extracting the square root of the last of these 
formula?, 

ds — >/ a n dx l9 
where the radical is taken positively. It follows that 

A 1 - *p - - * A 2 - d ?* - 0. . (15) 

«* <a n ds 

Similarly, the parameters of the line 2, in the direction of 
Xz increasing, will be 

I* 1 = 0, i* = * . . . ( 15') 
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Substituting these expressions in the equations (8) and (9'), 
we can get the angle Q between the two co-ordinate directions. 
The resulting formulae are 

cosQ = ^ , . . . . (16) 
\f a 11 a 22 

sinQ = ^ a (16') 

Equation (16) shows that the necessary and sufficient condi- 
tion that the co-ordinates x Jy x 2 may be orthogonal is a l2 = 0. 

If we take an infinitesimal element of surface, obtained by 
drawing two infinitesimal segments ds, S#, from a point P along 
the co-ordinate Hues, and completing the parallelogram, the 
area of this element will be 

da dsSs sin £2 

~ V a n dx x . s/ a 22 dx 2 . - = *J a dx ± dx 2 . (17) 

v a u a 22 

8. Fundamental observation (Gauss's) on the intrinsic geo- 
metry of a surface. 

We are now in a position to make an observation which will 
show fully the importance of the quadratic form (4) in the study 
of the surface. For this purpose we shall first make use of certain 
intuitive considerations in order to fix the idea of the intrinsic 
geometry of a surface. 

Let us give the concept of a surface a material form by think- 
ing of a flexible and inextensible sheet of matter on which figures 
can be drawn, and such that it can be deformed, bent, and folded 
in an infinite number of ways, but not torn or stretched. When 
a surface of this kind is deformed the figures drawn on it will 
take different spatial configurations, but some of their properties 
will be invariant. For instance, if two lines intersect, they retain 
this property however the sheet is deformed; the length of a 
segment of a line remains the same, and hence the distance 
between two points, measured along the surface (i.e. along the 
shortest line joining them which lies wholly on the surface), 
is unchanged; the angle between two lines which meet at a point 
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is unchanged; and so on. In short, all those properties which 
involve no element alien to the surface (or, as it is usually ex- 
pressed, which can be investigated without leaving the surface) 
are independent of the deformations of the surface, and con- 
stitute its intrinsic geometry. 

Even in elementary geometry we have examples of this 
kind. Plane geometry can be, and most of it is, constructed 
without using points outside the plane, and is therefore intrinsic 
as regards its plane; it still holds— at least for suitably restricted 
regions— if the plane is folded, or wrapped round a cone or a 
cylinder. 

Now consider the fact that the fundamental elements for the 
study of the metrical properties of a figure are: (a) the distance 
between two infinitely near points, and (b) the angle between 
two directions. In fact, the length of any line whatever is found 
by integration from the length of its infinitesimal elements, the 
area of a figure can be calculated by breaking it up into elemen- 
tary parallelograms, and so on. Now the formula) (4) and (8) 
(or (8'), &c.) provide us with precisely these two fundamental 
elements for the study of the intrinsic geometry of a surface, 
whenever the coefficients of ds 2 are known as functions of the 
#'s; these coefficients therefore determine the metrical (intrinsic) 
properties of the surface, and are invariant for any deformation 
whatever of the surface which does not involve stretching. Hence 
the particular interest of all those theorems which can be expressed 
analytically in terms only of the surface co-ordinates x and the 
coefficients a ik of the fundamental form; namely, the fact that 
they express properties belonging to the intrinsic geometry of 
the surface. The introduction into mathematics of this idea, and 
the fundamental observation relating to it, are due to Karl 
Friedrich Gauss. 

9. Note on developable surfaces. 

A developable surface is one which is flexible and inextensible 
and can be made to coincide with a region of a plane, without 
tearing or overlapping. Examples are the cylinder and the cone, 
and any surface formed of several portions of a plane. The in- 
trinsic geometry of surfaces of this kind, as we have seen in the 
preceding section, is identical with that of the plane, and their 
line element can take the same forms as that of the plane; e.g. 
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we can choose a system of surface co-ordinates a^, cCg, such that 
ds 2 = dx t * + dx 2 *. 

Consider a simple infinity of planes, which we may think of 
as represented by a linear equation in the Cartesian co-ordinates 
y l9 y 2 , y& whose coefficients are continuous functions of a parameter 
u. The envelope of this family of planes is a developable surface 
to which they are tangent planes. This proposition is rendered 
intuitive by the following argument based on infinitesimals. 

Let m x , T3 2J to 3 , ... be planes of the family corresponding to 
successive infinitesimal increments of the parameter u; and let 
g x be the intersection of w x and m 2 , 
g 2 the intersection of w 2 and ru a , 
and so on. By definition, the 
geometrical locus of all these lines 
is the envelope surface. The lines 
g l9 g 2 , . . . are called its charac- 
teristics or generators; each of the 
planes to contains two of them, 
forming an infinitesimal angle (cf. 
fig. 1), and the envelope may be 
considered as made up of an in- 
finite number of these infinitesimal 
plane regions. It is thus clear Fig. i 

that the envelope surface can be 

developed into a plane by successive rotations about the 
generators g v g 2 . . . . 

We shall shortly have occasion to consider the envelope of 
a particular family of planes (depending on a single parameter), 
namely, the tangent planes to any surface whatever <r, at all 
points of a specified line T lying on the surface. The envelope 
of these planes is a developable surface cr y ,, which is called the 
developable circumscribed to a along T; since the tangent planes to 
<r at points on T are also tangent planes to cr. n it follows that the 
circumscribed developable touches a along the line T. 

(b) Parallelism with respect to a Surface 
10. Geometrical definition. 

In Euclidean plane geometry, when two points P, P v are 
fixed, then to every direction drawn from P there corresponds 
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one and only one direction drawn from P 1 and parallel to the 
first. We now propose to extend this idea from the intrinsic 
geometry of the plane to that of any surface a whatever. 

For this purpose consider a point P of a, the corresponding 
tangent plane to, and a generic direction drawn from P tangenti- 
ally to or and therefore lying in to. We shall consider the direction 
as determined by the corresponding versor (unit vector) u, and 
shall accordingly refer merely to the direction u instead of the 
direction whose versor is u. Let P x be any other point on cr, 
and m 1 the tangent plane at P v 

If the surface a is developable, we can obviously establish a 
correspondence, which we shall call parallelism, between the 
directions drawn tangentially from P and those from P 1& The 
direction u x which becomes parallel to u in the ordinary sense 
when a is developed upon a plane will be called parallel to u with 
respect to the surface. 

This criterion fails in the case of a non-developable surface 
cr (even of the most elementary type, such as a sphere), and it 
is natural to look for an adequate generalization of it. The most 
direct solution is obtained by adding to the elements of position 
already considered (which are sufficient without further definition 
for developable surfaces) a connecting law, a priori arbitrary, 
according to which P 1 is to be considered as reached from P by 
moving along a specified curve T lying on a (the curve of dis- 
placement). 

We can now, with reference to this curve T, define parallel 
displacement from P to P x as follows. Consider the developable 
circumscribed to cr along T; this surface, which we shall call 
cr 7 , is, as we know, tangential to a along the given curve, and in 
particular at P and P v Hence the directions tangential to cr 
at these two points are also tangential to cr T . We can now take 
for our definition of surface parallelism on cr along T the paral- 
lelism which we have associated with the developable cr T , and 
we shall agree to say that the parallel at P 1 along the line T 
to a generic direction (in the surface) u at P is the direction 
(in the surface) which, on the developable cr T , is parallel to 
u in the sense just defined. 1 

1 A »i id pie and ao to Apeak automatic way of constructing parallel directions is 
to roll the surface cr alon^ a plane. Cf. Pkukmjo : '* Realizssazione cine mat ica del 
parallelismo auperficiale '\ in Rend, della R. Aec. dei Lincti, Vol. XXX (2nd 
half-year, 1921), pp. 127-128. 
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11. First consequences. Equipollence of vectors with respect 
to a surface. 

A necessary consequence of the foregoing definition is that 
—contrary to what happens for developables— the direction u x 
which is parallel with respect to the surface to the direction 
u at P, is not uniquely determined by P, u, P l9 alone, but in 
general depends also on the curve of displacement,. From this 
point of view the geometrical concept of parallelism can be 
compared with the physical concept of work, which involves 
the integral of an expression of the form X x dx x + X 2 dx 2 (where 
x y , x 2 are co-ordinates, of any kind, of the points of a). This 
integral in general depends on the line T of integration; only 
in the particular case when X x dx x -|- X 2 dx 2 is a perfect differ- 
ential is there no such dependence. 

Returning to parallelism along 7\ we must first point out that 
angles are unchanged by parallel displacement. That is to say, 
if a, b are two generic directions (in the surface) at P, their parallels 
at P 1 with respect to the surface, a 1? b l9 contain the same angle. 
This is obvious if we notice that we have parallelism in the 
ordinary sense in the plane upon which a T is developed, and that 
further the operation of development does not change angles. 

Up to this point of the discussion we have referred solely to 
directions, with their corresponding versors. It is clear that the 
same 4 , construction as that used to pass from u to u x can be applied 
to a tangential vector R of any (non-unit) length R. If u is the 
corresponding versor, we have R — Itu, from which we get a 
vector R x - 7iu x , i.e. a vector localized at P x , having the same 
length as R and the same direction as the versor u x which is 
parallel to u with respect to the surface. We shall naturally 
say that the vectors R and R t are equipollent with respect to 
the surface, with reference to the path T. In substance this 
concept of equipollence with respect to a surface reduces at once 
to parallelism, two tangential vectors being equipollent when 
they are parallel and have the same length. 

The case where the curve of displacement T is a geodesic 1 

1 I.e., with the usual definition, iiny line on o such that at every point its 
osculating plant* is perpendicular to the tangent piano to a. The linos which give 
the shortest path lying on the surface l>etweeii two given points always have this 
property. Further, the reciprocal theorem is nlso true (under certain restrictions); 
hence to define geodesies we can use sometimes one and sometimes the other 
criterion. We shall return to the question farther on (cf. p. 130). 
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on a calls for a special note in relation to parallelism. T is then 
also a geodesic on a, r In order to see that this is so, note that 
a and a. t have the same tangent planes at all points of T\ henoe, 
if the various osculating planes of T are normal to one of the 
surfaces, they are also normal to the other. When a T is developed 
on a plane, the geodesic T becomes a straight line (an immediate 
consequence of its characteristic property of giving the shortest 
path between any two points on it), and the directions u and u lf 
which become parallel in the plane as a result of the development, 
will make equal angles with this line. Since development does 
not change angles, we deduce that parallel directions on a at 
points of a geodesic make equal angles with this geodesic, 1 In 
particular, if u coincides with the direction of T at P, then 
u x will coincide with the direction of T at P l5 or in other words, 
the directions of a geodesic at its various points are all parallel 
(along the geodesic itself); more shortly, the geodesies are auto- 
parallel curves. It follows from these arguments that auto- 
parallelism is a characteristic property of geodesies and can be 
used to define them. 2 

12. Infinitesimal displacement. Infinitesimal form of the law 
of parallelism. 

Suppose in particular that P± is infinitely near to P, so that 
the path T is reduced to the elementary arc wliich is uniquely 
determined (except for infinitesimals of order higher than the 
first) by its extremities. For the development in this case we 
need only give the plane m l an elementary rotation round the 
straight line r in which it intersects w. Incidentally we may 
note that the direction of this line is said to be conjugate to the 
direction PP l9 at P or at P x (both points giving the same result, 
except for infinitesimals). We shall denote by — co the infini- 
tesimal vector, parallel to r, which in magnitude, direction, and 

1 Taking this property as defining parallelism with respect to a surface we can 
deduce from it for the sphere an elegant geometrieo-kinematical construction from 
which various other properties follow easily. Cf> G. Cohbklmki : "Genesi 
cinematic* intrinseca del parallelisms di Tjevi-Civita in Rend. deUa R. Ave. dci 
Linceu Vol. XXXII (1st half-year. 192'i), pp. 72 -76. 

a This statement will he recognized its an obvious extension to surfaces of any 
kind whatever of the primary intuition of the nature of the straight line, expressed 
by Euclid in the words eiOeta ypafJLjUL-fy eVrtf, fjrtr e'f tcrov rots ^0' Iclvttjs tnjficlots 
tcciTai (a straight line is that which lies equally with respect to all its points). 
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sense represents the elementary rotation by means of which w A 
is brought into coincidence with to. Then to will be the elementary 
rotation which will bring w 1 back from the plane of development 
to to its original position as tangent plane at P v Let R be a generic 
tangential vector drawn from P\ in order to find the equipollent 
vector R x at P v we draw from P l5 when in the plane of develop* 
ment, the vector equipollent in the ordinary sense to R, and then 
bring the plane w 1 back to its original position, carrying with it 
the vector so constructed. Thus the vector R x is merely R, 
after having undergone a displacement (of no interest if we 
consider the vector independently of its point of application) 
and also the rotation c*>. From the elementary principles of 
rigid dynamics we find that the difference between the vectors 
R x and R, i.e. the vectorial increment dR of the vector R during 
the parallel displacement from P to P lf is given by 

dR — to A R, 

i.e. the vector product <oR. 

As both co and R are vectors in the plane to it follows that the 
increment dR is perpendicular to this plane, or, in particular, 
is zero. 1 

We shall now show that this condition, combined with the 
condition that R x is a tangential vector (i.e. belongs to to^, 
completely determines the vector R v so that we may take as 
the differential definition of parallelism with respect to a surface 
the following geometrical relations, in which n denotes the normal 
to to: 

dR |j n, (18) 

R 1 || TO j. 

To prove this, note that the equation 
R = R x — dR 

must be satisfied; i.e. that it must be possible to resolve the 
vector R into one component R x parallel to a given plane and 

1 The last-mentioned ease will occur if R has the direction of W, i.e. the con- 
jugate of PP X ; in this, and only in this ease, the parallel] Ei wjth respect to the 
surface coincides with the Kuclidean parallel. This remark is due to Professor E. 
Bompiani, who has made use of it to generalize the theory of systems conjugate to 
surfaces belonging to non-Euclidoan spaces ; cf. Atti dc( /?, ht. VciwUt, Vol. hXXX., 
1921, p. H2Q T 
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another, — dR, parallel to a given direction not contained in 
the plane; it is known that this can be done in only one way. 1 

13. The intrinsic character of parallelism. 

Returning to the question of parallel displacement along an 
arc T of finite length, we see at once that if T is a segment of 
a geodesic, parallelism depends solely on the intrinsic properties 
of the surface a\ i.e. it depends on the nature of the linear element 
ds, and not on the configuration of the surface in space, as might 
a priori have been supposed from the geometrical construction 
(which uses the surrounding space) or the equivalent formulae 
(18) and R x J| td^ 

In fact, we need only recall the two general properties of the 
conservation of angles and the autoparallelism of geodesies. 
The parallel u L at P x to a generic direction u drawn from V 
is determined by the conditions of (a) belonging to the surface 
a, and (6) of making the same angle at P^ with the geodesic of 
displacement as u does at P. It will be seen that we are dealing 
with angular properties which depend solely on the metric of cr. 

This argument for a geodesic T can easily be extended to 
the general case, if we suppose T divided up into elementary 
displacements, from a generic point P to a very near point P v 
In a displacement of this kind the elementary change in the 
direction u is determined, as we have seen, by the extremities 
PPj, the nature of the line joining these extremities has no 
effect, and we may therefore think of it as a displacement along 
an infinitesimal segment of a geodesic. But a displacement of 
this kind depends only on the intrinsic properties of the surface; 
hence we see that in general this is true also for the change in 
u, and therefore for parallelism, whatever may be the line of 
displacement. 

The same result holds good for equipollence, i.e. for the 
displacement of vectors of any (non-unit) length whatever. 
In fact (§ 11), this length by definition remains unchanged. 

1 Some interesting geometrical consequences, especially for the case of ruled 
surfaces, have been pointed out by A. Myllek in some notes in Comptes 
Ilendus ; of. Vol. 174 (1922), pp. 997-998 ; Vol. 175 (1922), pp. 939-941 ; Vol. 176 
(1923), pp. 483-485. Cf. also a recent note by O. Maykr: "line interpretation 
geometrique de la secotide forme qnadratique fond am en tale d'une surface en 
relation avec la th^orie du parallelisme ibid., Vol. 178 (1924), pp. 954-956. 
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14. The symbolic equation of parallelism. 

The condition (18) can be put in a more expressive form if 
we note that it is equivalent to saying that the vector dR is 
perpendicular to every direction which is tangential to a at P, 
or in other words, if we think of such a direction as being deter- 
mined by an infinitesimal displacement of the point P along 
the surface, that dR is perpendicular to all these displacements. 
In symbols, if SP denotes the infinitesimal vector representing 
the displacement, we shall have 

dR X 8P = 0 (19) 

for any SP whatever which is tangential to a -- an equation similar 
in form to the equation of virtual work. If dY v (v = 1, 2, 3) 
denotes the components of dR, and hy v (v — 1, 2, 3) the com- 
ponents of SP (in both cases referred to the orthogonal Cartesian 
co-ordinates y v y 2s y 3 ), we have identically 

dR X SP = i v dY lf hy v , . . . (20) 
i 

and the vectorial relation (19) is thus transformed into the scalar 
relation 

i v dY v hy„ = 0; .... (39') 
i 

this, or the original equation (19), may be called the symbolic 
equation of parallelism. 

15. Intrinsic equations of parallelism. 

As the symbolic equation involves geometrical elements which 
do not belong to the surface, it does not show directly that paral- 
lelism is a concept depending only on the intrinsic properties 
of the surface. But we can deduce from it without much diffi- 
culty other equations which have this important characteristic. 

In order to do so, we shall naturally try to find the values in 
terms of intrinsic elements of the quantities dY lf and Sy t , which 
occur in equation (19'). Take first the displacements 8y r . The 
only condition imposed on them — other than that of being 
infinitesimal— is that they represent a displacement along the 
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surface of cr; they can therefore be expressed in terms of the 
corresponding (arbitrary) variations 8x v 8x 2 of the surface co- 
ordinates, by differentiating the equations (1). We accordingly 
have 

8y v = X k l y »8x k . 
i dx k 

As the vector R is tangential, we can define it intrinsically 
by means of its contravariant components, and substitute for 
the YJs the expressions (12). 

Putting 




for the sake of shortness, the identity (20) can therefore finally 
be written in the form 

2 

rfR x 8P = X k r k 8x k ; . . . (20') 
i 

since 8x l9 8x 2 are completely arbitrary, it follows that the symbolic 
equation of parallelism is equivalent to the two following equa- 
tions: 

r k 0 (k = 1, 2). ... (22) 

These are the two equations which define the increments 
dR l , dR 2 to be assigned to the components of a generic vector 
R when it undergoes a parallel displacement along the elementary 
path dx x , dx 2 ; that they are really intrinsic equations will be 
clear when the expressions r k are written out in full, as will now 
be done. 

Differentiating the product on the right-hand side of the 
equations (21), and using the expression for the coefficients a ik 
given in formula (3), the expression for r k becomes 

r k = iijan<m + s„ 4 R'ty ~y»-d Xl , 

i ii ox k dx J dx L 

or r, ^ X, a„ dR> + fl' dx t Z„ d %. . (21') 

l l' l OX k OXjOXi 



We have now to show that the result of the summation with 
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respect to v can be expressed in terms of intrinsic elements; 
more precisely, that it is a linear combination of the derivatives 
of the coefficients a ik . Consider its general term, and note that 
we can write 

dx k dxjdxi dxj \dx k dxj dx j dx k 9a?/ 
or analogously 

d Vv &y v _ d /dy v dy\ d 2 y„ dy v 



fiy* %y»\ _ _d*i 

\dx k dxj dx t \ 



dx k dxjdx t dx t \dx k dxj dx t dx k dxj 

In order to maintain symmetry in the indices j and Z, we shall 
take half the sum of the expressions on the right of these equations 
to represent the value of the term in question. Noting that the 
sum of the two terms preceded by the minus sign is exactly the 

derivative with respect to x k of the product we get 

dxjdx t 

dy» d 2 y, ^ 1 r /3y v 3yA + 9 fiy v 3yA _ 9 / s ^< 0 ^\1 

dx k dxjdx t * [dXf \dx k dxj dx l \dx k dxj dx k \dx*dxj J 

Now sum with respect to v. Remembering the values of the 
coefficients a ik , we get 

\ v dx k dxjcxi Ldx, dx t dx k j 

Hence this sum has been put in the required form. The 
right-hand side of this equation is represented shortly by the 
symbol 

(ChristoffeVs sytnbol of the first kind); which is easy to remember, 
the arrangement of the indices corresponding to that of the 
negative term of the linear combination above, while the two 
positive terms have the same indices but differently arranged. 
We shall investigate presently some properties of these symbols; 
for the moment we need only remark that they represent certain 
functions of the surface co-ordinates x lJ x z which depend only 
on the fundamental quadratic form. 
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Beturning to the expression (21') for the quantities r ki we 
can now write it in the form 

2 2 

r b = S, a kj d& + 2, 7 [jl, k] R> dx t (k - 1, 2). (21") 

1 1 

Before continuing the argument, it is important to note that 
the quantities r /9 which (as shown by equation (21') ) depend 
on two vectors (R and the displacement dx x , dx 2 ) as well as on 
the coefficients of ds 2 and their first derivatives, are covariant. 

This follows from the in variance of the linear form H k r k dx k , 
which is itself shown by the identity (20'). 1 
The system which is the reciprocal of the 77/s, namely, 

t' ; =-- ± k a ,b r k (i = 1, 2), 

1 

is accordingly contravariant\ using equation (21"), it can be put 
in the form 

2 2 
t' = dR l + R> dx t a Uc [jl, Jc], 
1 1 

2 

or, putting S A a a ' [jl, k] {jl, i} 

1 

(ChristojfeVs symbol of the second kind), in the form 

t 1 = + 4i { i J # rf^. . . (21"') 

j' 

The expiations of parallelism (22), as is a priori to be expected 
from their geometrical significance, are invariant whatever system 
of curvilinear co-ordinates x ly x 2 is chosen. This is evident from 
the fact that they express the vanishing of the covariant system 
r h . (cf. remarks on pp. 71, 84). The equations of parallelism can 
of course also be put in the equivalent form 

t* = 0 (i = 1, 2), . . . (22') 

which also shows that they are invariant. 

Solving them for the differentials dR l , we get 

dR' = — Zjt {jl, i}R } dx t (i = 1, 2). (23) 
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in 



This is the final form of the differential equations of paral- 
lelism. It gives the increments of the contravariant components 
of a surface vector in an equipollent displacement along the 
elementary path dx £i expressed in terms of the dxjs, the com- 
ponents of the vector, and certain functions of position (to be 
considered as given) depending only on the coefficients of ds 2 
and therefore on the intrinsic nature of the surface. 

1 6. Christoff el's symbols. 

We have introduced the symbols 

(jl, t) = & t o*Ul,kl . . . (25) 

1 

which can also be formally extended to quadratic forms in n 
variables; we now propose to examine their more elementary 
properties. 

First, it is obvious that both symbols are symmetrical with 
respect to the coupled indices, i.e. that 

= Pi.*]. {jhi) - 

Consequently for a form in n variables there are n of each 
kind corresponding to each pair of indices. Hence there are 
in all Jw a (w + 1) of each kind (the number of first derivatives 
of the coefficients a ik ). 

It is easy to express the derivatives of the a tfr 's in terms of 
Ghristoffel's symbols. Writing down equation (24) and the 
corresponding equation obtained by interchanging / and and 
adding them, we get the following formula, which frequently 
occurs: ~ 

v** - Ci'. *] + fi*. q. . • . (24') 

OX f 

From equation (25), applying Cramer's rule in the usual 
way T we can get the symbols of the first kind in terms of those 
of the second kind. Multiplying by a tm and summing with respect 
to i, we get in fact 

[jl, m] - S £ a im {jZ,i}. . . . (25') 
i 
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Lastly, we shall prove a formula which is frequently used 
and gives the derivatives of the determinant a (or more precisely 
of its logarithm) in terms of Christoffers symbols. 

Applying the usual rule for differentiating a determinant of 
order n, we see that the derivative of a with respect to any one 
of the x's (say x £ ) is the sum of n determinants, any one of which 
(say the Jfcth) is obtained from a by replacing the elements of 
the &th row by their derivatives. A determinant of this kind, 
expanded from the kth. row, can be written in the form 

i J dx 

(the co-factor of a jk being a Jk multiplied by a); hence 

d l = ?: k da > k a> k a, 

or dividing by a, and using formula (24'), 

Finally, by formula (25), wc get 

ox i i i 

The two sums in this formula differ only in the letter chosen 
to denote the index of summation; hence we have 

dx { i 

This formula is more frequently written in the form obtained 
by dividing by 2, i.e. 

- 2* [hi, k} (i = 1, 2, . . . n). . (26) 

17* Equations of parallelism in terms of covariant components. 

It is easy to find equations analogous to (23) for the differentials 
of the covariant components of the vector R. These components 
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in fact are obtained from the contravariant components by means 
of the relations (cf. §§ 3, 6, pp. 90, 96) 

Ri = S, «»<,«'; 

1* 

hence, differentiating and changing j into k in the second sum, 

dRi = in d " ,J dx t ft + ± k a ik dR k . 
« 1 ox t 1 

Now substitute for dR k the expression given for it by formula 
(23), and we shall have 

dR t - £ }l d fS> dx t ft - i jUe a Lk {jl, k) ft dx L . 

X VXi 1 

In the first sum, we can express the derivatives of the co- 
efficients a tJ in terms of the symbols of the first kind, so getting 

S„ + [UJVdxiW; 

in the second, we can sum with respect to k (cf. formula (25' ) , 
so getting 2 

— ^[jhqftdxt. 
i 

We thus have 

dR h i: M [U 9 j}dx l R J . 
i 

In order to make the contravariant components disappear 
altogether, we substitute for R' from the formula 

ft = X k a*R k ; 
i 

summing with respect to j (which, by formula (25), changes the 
symbol of the first kind to one of the second kind), we get 

2 

dRi = 2 A j { ^7 J k ) R k dx t . 

Finally, changing k into j in order to show more clearly the 
analogy with the equations (23), we have the equations 

dR t k jt {UJjRjdxc (i 1, 2), . (27) 

( I> 655 ) 5 
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which are equivalent to (23). They are in fact the result of com- 
bining certain formulae and identities with the equations (23); 
and reciprocally, starting from (27), an analogous process will 
give (23), as can easily be verified. 

18* Some analytical verifications. 

We are now in a position to give an analytical proof of some 
properties of parallelism which have already been obtained as 
immediate consequences of the geometrical definition. 

Consider first the parallel displacement of a vector R along a 
finite segment T of a curve, from P to P v Let the curve be 
defined by the parametric equations 

a* = *<M, (28) 

where s represents any parameter (which may, if we w'mh, be 
the length of the arc measured from an arbitrary origin P 0 ). 
The quantities R are to be considered as functions of s with 
arbitrarily assigned values at P. The equations (23), divided 
by ds, become 

R! = -^{jlij&xt (i = 1,2), 
i 

where the dot indicates differentiation with respect to s, and 
the quantities x t are of course obtained by differentiating equa- 
tions (28), and are therefore to be considered as given functions. 
These are two linear differential equations of the first order, 
in the normal form with respect to the derivatives of the two 
unknown functions J? 1 , R 2 ; hence, as is known from the calculus, 
they uniquely determine these two functions when the (arbitrary) 
initial values are given. We have thus a confirmation of the 
geometrically obvious fact of the possibility of displacing an 
arbitrarily assigned surface vector, and of the uniqueness of tJte 
result. 

Using the differential equations already found, we shall now 
prove that the length of a vector and the angle between two vectors 
are unchanged by a parallel displacement. These two results 
can be proved simultaneously, as follows. Let R, V, be two 
vectors. Give them a parallel displacement along an infinitesimal 



DIFFERENTIAL QUADRATIC FORMS 115 



path, and calculate the change in their scalar product due to 
this displacement. We shall have (cf. formula (14) ) 

d(R X V) = it&dVi + i t VidR 1 ; 
1 1 

substituting for dR % and dVt from (23) and (27), this becomes 

2 2 
<Z(R X V) = S 07 & {il, j\ Vj dx L - S w F, { jZ, i} # 

Interchanging i and j in one of the two sums, we see that the 
sums are equal, and therefore 

d(R X V) = 0; 

i.e. the scalar product is unchanged by an infinitesimal (and there- 
fore also by a finite) displacement. Now let V coincide with 
R, so that R X V — R 2 , and we at once obtain the result 
that the length of a vector is unchanged by a parallel displace- 
ment. Combining this result with equation (13), we see that 
as the scalar product of two vectors and their respective lengths 
are all unchanged, the angle between them (provided neither 
vector is of zero length) must also remain the same. 

19. Permutability. 

While a tangential vector is intrinsically defined by two 
numbers, the geometrical notion corresponding to it, as we have 
already said, is a segment of a tangent line at a point P of the 
surface a — an entity which does not belong wholly to a, t at least 
in general. If, however, we are dealing with an infinitesimal 
vector, the element of the tangent plane in which it lies coincides 
with the element of the surface a around P, and we may say that 
we are using only points lying in a. Hence, for a generic infini- 
tesimal tangential vector we can use the ordinary notion of a 
displacement from the origin P to the final point P t , where 
P x also lies on a. As the length R reduces in this case to a linear 
element d$, it follows from the definition of direction parameters 
that the quantities R', which are equal to A'ds, are identical with 
the increments dx t of the curvilinear co-ordinates in passing 
from P to P v 

Next, consider two systems of differentials dx„ 8x it and the 
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corresponding infinitesimal vectors (or displacements) dP — 
PP X , SP = PP 2 (assumed to lie in a). We shall use the symbol 
df to denote the increment of / (where / is a generic vector or 
any scalar or vector quantity derived from it) corresponding 
to a parallel displacement from P to P x ; the symbol 8/ 
will be defined in the same way for the displacement from 
P to P 2 . 

With this convention d8P will represent the vectorial incre- 
ment of SP for a displacement from P to P x , and d8xt the incre- 
ment of the associated contravariant system Sa? £ . For the latter, 
equation (23) gives 

2 

dZx i = — 2^ {jl, i}8x j dx l (i = 1,2). . (29) 
i 

Similarly, the displacement of dP from P to P 2 gives the 
increments 8dx h for which we have 

2 

8*Za? £ = — {j7, i} dXjSx t . . . . (29') 

Interchanging j and Z in one of these two sums, and using the 
property of symmetry of Christoffel's symbols, we see that 

d8x L = 8dx h (30) 

which proves that the two operators d and 8, as just defined, are 
perm,vtable. 

The geometrical meaning of this result is particularly simple. 
Note first of all that for infinitesimal vectors — the only kind 
considered here — the elements of the contravariant system are 
merely the differences of corresponding co-ordinates. Hence, 
if the co-ordinates of P are the x/s, we shall have in the first 
place x L + dx^ as the co-ordinates of P 1% and x i + 8x t as the 
co-ordinates of P 2 . Let Q be the point on a reached by construct- 
ing at Pj the vector equipollent to 8P; as the contravariant 
system for this vector is 8x t +- d8x h we get finally 

x t + dxi + 8x £ + d8x t 

as the co-ordinates of Q. 

Similarly let be the point on a reached by constructing 
at P a the vector equipollent to dP; we get the co-ordinates of 
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Q* by interchanging the operators d and 8 in the co-ordinates of 
Q, which gives Xi + &Xl + dX( + M x, 

Applying equation (30), we see that Q coincides with Q*. 
A more illuminating way of expressing the same result is to say 
that the parallelogram rule holds for infinitesimal vectors which are 
equipollent with respect to a surface. 1 

It may be noted that in the foregoing argument second -order 
quantities of the type dbx have been taken into account, but 
{dx,) 2 , (Sx t ) 2 have been neglected. If the latter were to be taken 
into account, by considering the vectors 8P, dP and the equi- 
pollent vectors at P 2 and P x as vectors in space, we should no 
longer have a parallelogram, nor even a closed quadrilateral. 
In fact, referring to the space construction already given (cf. 
p. 105) for vectors equipollent with respect to a surface, we see 
that while dSP and MP are both in the direction of the normal 
to a at P, yet their lengths are in general different, since the 
three points P, P A , P 2 and their respective tangent planes have 
a priori no relation between them except that of being infinitely 
near one another. 

The formula? (29) or (29') provide a definition of the second 
differentials which is invariant with respect to any change of 
variables. In order to grasp the significance and value of this 
fact, we must recall the conventions as to second differentials 
which are adopted in the elementary theory of the calculus. 

To fix the ideas, consider the simpler case of a single inde- 
pendent variable. Ordinarily the convention d 2 x 0 is adopted; 
i.e. the increments dx are considered independent of x } as is 
quite legitimate. But this simplification does not hold if we 
change the independent variable by putting x /(f), from which, 
on the hypothesis that we have a reversible transformation, we 
can reciprocally find f as a function F(x) of x. In fact, differ- 
entiating twice the formula f — F(x), we get 

d£ = F'(x)dx, 

d?£ == F"(x) (dx) 2 + F'(x) d 2 x, 

1 This property might be taken as the starting point for an intrinsic proof of 
the properties of parallelism, depending only on the metric of <r, and making no 
uso of the surrounding space. The method can be applied directly to manifolds 
V n of any number of dimensions. Cf. H. Wetl, Jiaurn, Zcit, Materie^ §14 (Berlin, 
Springer, 1923). 
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which shows that even if we make d 2 x 0, d 2 £ will not in general 
be zero. 

If then there are n variables, it is usual to consider only 
systems of differentials which are completely independent of 
the variables x t , so that we have not only d^x i = 0, but also, 
for any two systems dx h 8x l of these differentials whatever, 

d8x t = 8dx t -~ 0 (i = 1, 2, . . . n). 

Now change the variables, by putting x t — fi(x), and there- 
fore x, =5 ^X^)- Using the condition 8dx t = 0, we get 

so that the property 

8dx L - <2Si;, ; 

also holds, but these differentials will not in general be zero. 
The usual convention is therefore legitimate, and is suggested 
by obvious reasons of simplicity, when in a given question we are 
dealing always with the same variables; but it is not invariant 
for a change of variables. 

if instead we adopt the formulae (29) and (29'), and suppose 
that 

d8x, = 8dXi ■= - S /Z {jl %} 8x j dx h . (31) 
i 

we get, for the same geometrical interpretation of this formula, 

2 

d8x £ = Sdxi --- — E, 7 i) 8x } dx h 
i 

where the line above the letters denotes that Christoffel's symbols 
refer to the variables x, i.e. to the transformed quadratic form 

2 

ds 2 = ^a^dxidx^ 
i 

We could of course verify by direct substitution that the 
form of the expressions (31) is unchanged by the change of 
variables. We are in fact dealing with an immediate corollary 
of the invariance of the equations t' — 0 (cf . § 1 5), which follows 
at once by putting R l = 8x t in these equations. 
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On account of this invariant property the second differentials, 
defined as in (31), are called contravariant, although strictly 
speaking the term applies not to them but to the expressions 

ddxi + 2^ { jl, i } Sxj dx h 
1 

which in any case (see § 15) constitute a simple contravariant 
system. 

(c) Extension of the Foregoing Notions to w-dimensional 
Manifolds of any Metric 

20. n- dimensional manifolds. 

Alongside the extension of the use of geometrical terms which 
was developed in Chapter 1, we shall now introduce, on the lines 
of the discussion in subdivision (a) of this chapter, the fundamental 
notion of an n-dimensional metric manifold, where n is any 
integer. 

If there are n variables x x , x 2 , . . . x„, we know that the aggre- 
gate of values which can be assigned to them is called an n- 
dimensional manifold. Now suppose that together with these 
variables and their field of variation there is also given a priori 
a diffeiential quadratic form 

n 

ds 2 = £, /r a ik dx, dx k , .... (32) 
1 

in which the coefficients a tl . are given functions of the jz's, and 
a lk ~ a k; . We shall agree to consider ds as the distance between 
the two infinitely near points whose co-ordinates are x l9 x 2 , . . . x n 
and x x + dx l% x 2 + dx 2 , . . . x n + dx n ; we shall in consequence 
agree that ds is to be invariant for any change of co-ordinates. 
Having thus introduced into the manifold the notion of an 
elementary distance, we get from it at once by integration the 
notion of the length of a line, and also deduce from it, as we shall 
see, the most direct criteria for defining all the properties of 
extension (angles, areas, volumes, &c). 

A manifold with which has been associated a quadratic form 
of the type (32), or in other words, a manifold whose metric is 
given, is called a metric manifold, and will be here denoted briefly 
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by F n . Since ds 2 is invariant, the coefficients a ik obviously form 
a symmetrical covariant double system; we shall throughout 
the discussion suppose that they and their first and second 
derivatives are finite and continuous functions, and so chosen 
as to make the quadratic form definite and positive. 1 Thus the 
distance between two real points will always be real; the deter- 
minant a of the coefficients a ik will always be positive. With the 
usual notation the reciprocal elements will be denoted by a 1 *, 
&c. 

We shall now extend the concept of direction to a generic 
F M . We shall consider direction as determined by two infinitely 
near points, i.e. by a system of dx'&. As before, we shall apply 
the term parameters to the n contravariant quantities 

A 1 - d p (i 1, 2 n) 

ds 

which define a direction (and are uniquely determined by it), 
and we shall apply the term momenta to the covariant quantities 

n 

A; --= E k a A \ 1c {i — 1, 2, ... n). 
i 

Thus for any value of n we have again two simple systems, 
reciprocal with respect to ds 2 , or to the form (32) (cf. p. 96, 
Remark III). 

The parameters are connected by a relation completely 
analogous to (5), and the formulae (5'), (5"), and (C) can be 
extended without difficulty, the summations being now from 
1 to ft instead of from 1 to 2. The aggregate consisting of a direc- 
tion and a positive number R will be called a vector R in a V n 
(R being the magnitude of the vector); the products of R by 
the parameters of the direction will be called the contravariant 
components R\ and the products of R by the moments the 
covariant components R im We shall then have a set of formulae 
analogous to (11), (11'), (11"). 

Suppose the x's expressed as regular functions (i.e. finite and 
continuous, together with all their derivatives which enter the 

1 At the end of the chapter (p. 141) we shall also consider shortly the case of 
an indefinite quadratic form. This case was at first neglected as offering little 
likelihood of useful application, but the theory of relativity has now invested 
it with very great importance. 
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discussion, in the field considered) of p parameters . . . Up, 

where p is a positive integer less than n: 

*i — «2» • • • «V) — 1, 2, . . . n). . (33) 

We shall make the hypothesis that at least one set of p func- 
tions/is independent, i.e. that p is the characteristic of the func- 
tional matrix of the f's with respect to the w's. Hence the x's 
are connected by n — p relations and no more, namely, those 
which we should get by eliminating the w's from equations (33). 
In this way we define a subordinate ^-dimensional manifold 
W pi whose co-ordinates are the u 9 s. W p is said to be contained or 
immersed in V n , since to every system of p values assigned to 
the u'& there corresponds, by (33), a system of n values assigned 
to the cc's (i.e. every point of W p belongs to V n ), while not all 
the systems of values which can be assigned to the sc's satisfy 
the equations (33) (i.e. not all the points of V n belong to W p ). 
Now, remembering the analogy with the case n — 3, p =^ 2 
(cf. p. 87), we naturally assign to the distance between two points 
of the subordinate variety the same value (32) as that of the 
distance between the same two points when they are considered 
as belonging to V n \ i.e. we construct ds 2 for the subordinate 
manifold by substituting in (32) for the dx's their values obtained 
by differentiating the equations (33). In this way we can easily 
find the coefficients of the fundamental quadratic form in the 
du's, and the metric of the /^-dimensional manifold W pi immersed 
in V tl , will be completely defiued. For p = 1 the definition 
coincides with that given in Chapter I, § 1 , for a line, of which the 
equations (33) are the parametric equations. 

If p z= n — 1 , the W p is often called a surface, or more pro- 
perly a hypersurface. 

21. Euclidean manifolds. Any V n can always be considered 
as immersed in a Euclidean space. 

If ds 2 reduces to the sum of the squares of the differentials, 
as in the case of orthogonal Cartesian co-ordinates, the quadratic 
form is said to be Euclidean, and the co-ordinates, by an obvious 
analogy with the elementary cases n — 2 and n = 3, are called 
orthogonal Cartesian co-ordinates. When this is so, all Christoffel's 
symbols obviously vanish identically, since the coefficients a# 

(D 655) *• 
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are constants. Given a generic V n9 and therefore a generic cfe 2 , 
it is not in general possible to bring about a change of variables 
such that ds 2 takes the Euclidean form, or in other words to 
establish a system of Cartesian co-ordinates in V„; if it is possible 
V n is called a Euclidean manifold, and we shall denote it by 
S n . We shall find later on the conditions to be satisfied by the 
a^'s in order that V n may be Euclidean. V n9 however, can always 
be considered as immersed in an iNT-dimensional Euclidean variety, 
where N > n, as we shall now show. 

We propose to determine N functions of the x's, 

y 2 (x), . . . y N (x), . . . (34) 

such that when we differentiate them and take the sum of the 
squares of the differentials we get a form, quadratic in the dx's, 
which is identical with the given ds 2 , so that we have identically 

N n 

s ,<ty„ 2 = <*ik dx t dx h . 
1 i 

Expressing the dy'a in terms of the dxs, we have 

ls4'- I******* - ha^dxtdx,/ 
i i dx L dx h i 

or, equating the coefficients of dx L dx k , 

£ dy, dy„ _ ^ (i, k — 1,2 n). (35) 

1 ox t ox k 

We have thus obtained \n(n + 1) partial differential 
equations of the first order in the N unknowns y\ unless any of 
these are mutually inconsistent (and a more detailed discussion 
would show that this is not so) we deduce that the problem is 
soluble for N = \n(n +1), and a fortiori for N > %n(n +1). 
The s/'s can evidently be considered as Cartesian co-ordinates in 
a Euclidean manifold (space) S N m which the given V n is immersed, 
V n being parametrically represented by the values of the y's 
in (34) (cf. formula (33) ). It is therefore possible to immerse a 
generic V n in a Euclidean space provided N > hn(n +1). 
For particular F n 's, however, a smaller number of dimen- 
sions may suffice; e.g. for a Euclidean V n > n dimensions are 
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sufficient; in this case the y 9 & are Cartesian co-ordinates of V n 
itself. 

If N has the smallest possible value, the difference N — n 
is called the class of the V n . Since the minimum N is not greater 
than \n{n -f- 1), the class cannot be greater than %n(n + 1) — ^, 
or ^n(n — 1). Further, N cannot be less than n, 1 and therefore 
the minimum value of the class is 0. For n =-= 2, the class is 1 , 
which shows that every binary ds 2 may be considered as belonging 
to an ordinary surface. In other words, the parametric expressions 
which were our starting point (p. 86) impose no restrictions on 
the study of the intrinsic properties of a ds 2 in two variables. 

22. Angular metric. 

We shall now extend to the generic V„ the notion of the 
angle between two directions. The most direct method is by the 
formal extension of formula (8) (and its equivalents) by summing 
from 1 to n instead of from 1 to 2; this however will be legitimate, 
if we wish to avoid imaginary values of ft, only when we have 
shown that the expression on the right < 1. 

In order to do this, we shall examine some algebraic properties 
of quadratic forms. 

Let , " 

1 

be a definite positive quadratic form. Suppose that the z*& are 
linear combinations of two different systems of non-proportional 
variables, so that we may put 

*i = As, + py t ; 

we therefore have 

n 

4> a ---- a* (Ax f + /*y 4 ) (Ax* + py k ) 

= S t . a lk [A 2 x L x k + A/x (x t y k + y, x iC ) + V. v\ 
1 

n 

1 A quadratic form <f> = Uifc "t*£i&i» called irreducible when the number of 
1 

independent variables cannot be reduced by substituting for the £ k linear com- 
binations of them. This is always so when the form is definite, as in this case the 
determinant a of the coetiioients is certainly not zero (p. 90). A cannot therefore 
be less than n. 
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Splitting up the right-hand side into three sums, and putting 

ii 

^ik^ik ®i x k — <f>xx> 
1 

n ?i 

^tk a ik^yk — ^ik a tkyt x k = <f>x„ = <f>„xy 
i i 

n 

^ik^ikViVk = <f>yy, 

we have finally 

A. = + 2 A,*** + . . (36) 

This may be considered as a quadratic form in A and /a; it 
is easy to show that it is definite and positive, i.e. that it is 
always greater than 0 when A and /x are not zero. In fact, <f> zz , 
considered as a quadratic form in the z's, is always positive, 
provided at least one of the z's is not zero; and this condition 
is equivalent to our hypothesis that the x's and y'a are not 
proportional. 

From (36) we therefore get 

A 2 ^ + 2 \tx<f> xu + ^4> inf > 0, 

whatever A and /z may be. 

Hence, from an ordinary property of quadratic inequalities, 
we get 

4>xx4>yy — 4>lv > °> .... (37) 

which is the formula we wished to prove. 

We now return to the proposed formal extension of formula 
(8). What we have to prove is that 

/ n . v 2 

(s ifc a*AV) < 1, 

i.e. that , v - A 2 . ft 

1 — ^S t *a tft A>M > 0, 

whatever A* and /i l may be, provided they are not proportional 
(since we exclude the obvious case where the directions coincide 
or are opposite). 

This inequality can now be proved at once. Introducing the 
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quadratic relations between the parameters, we oan write it in 
the form 

(Sat^A'A*) (ip^/iV) - (i+a* A>*)* > 0; 

and this is merely (37), with the z's and j/'s replaced by the A'^s 
and /it's. 

We may therefore assume 

cos& = £*a a AV. • • - • (38) 
1 

and the other expressions equivalent to it will also hold good, 
namely, 

cos* 2, A> t , (38') 

1 

cos* = £, A, /x 1 , (38") 

1 

cos* = 2^a' 7 'A^, . . . (38'") 
i 

in which the moments (cf . § 20) of one or both directions take the 
place of the corresponding parameters. 

In the provisionally excluded case of two coincident or opposite 
directions (A' -~ + fO* we must naturally agree that cos* = + 1. 
With this convention the four formula? just given still hold 
good; the right-hand side in each case also reducing to + 1 in 
virtue of the fundamental relations 

ZaOfrA'A* = S.A.A' = S i7f a*A t A, = 1 

1 1 1 

between the parameters and moments of a single direction 
(cf. §20). 

Now consider our V n , immersed in a Euclidean space S N . 
Given two directions X, fx belonging to V u9 and drawn from the 
same point, the angle between them is defined in two ways, 
since the directions X, |jl may be specified either by their para- 
meters A*, fji 1 relative to V„, or by their parameters A'", [x v relative 
to S N } and formula (38) may be applied to either set. We 

1 We may note in passing" that in a Euclidean space, referred to rectangular 
Cartesian co-ordinates, the parameters of a direction coincide with its moments; 
also (as follows directly from the properties of linear orthogonal substitutions) the 
formulae of covariance are identical with those of contravariance (cf. § 3, p. 67). 
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shall call the angle calculated in the first way and in the 
second &', and we shall show that cosQ- = cos&'. 

Remembering that for the Cartesian co-ordinates y of S N9 

ds 2 has the form 2 dy* we have 

n 

cos& = S^a^A*/** 
i 

cos$' = 2 r A'V- 
j 

Now the parameters A'", /a'" are given by the formula* analogous 
to (7), (7') 

rfs i* dx t ds i ' 3 :c t 

OS l 0X A OA? j CX k 

We have therefore 

i i d d x /r i \ vx i a x k 

and therefore by (35) 

cos ft' = E# A ? /x 7r a ik = cosO-. 

Q.E.D. 

Now consider two vectors R, V, whose directions are X, (X 
respectively. We can extend the definition of the scalar product 
by giving this name to the invariant 

R X V =- RV cos&, 

where & is the angle between the two directions. For each of 
the various expressions for cosS- we shall get a corresponding 
expression for the scalar product by multiplying (38), (38'), 
(38"), (38'") in turn by RV. The resulting formulae are: 

R x V = a lk R l V k - 2. & V, = S, R, V 1 - X ik a* R, V k . 
i ill 

If one of the vectors, say V, is of unit length, we shall call 
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the product R X V the "projection of the vector R on the direc- 
tion determined by the unit vector V, or its component in this 
direction. 

The orthogonality of two non-zero vectors is evidently ex- 
pressed by the vanishing of their scalar product. 

We can now make some useful remarks relating to certain 
particular directions. Let a £ denote 1 the direction of the co- 
ordinate line i (i.e. the unit vector in that direction, in the sense 
of the xjs increasing); remembering that for a displacement 
along the line i we have dx t ---- 0 for r =1=" % and ds = %/ a u dx l9 
we see that the parameters s]' of the direction s t are all zero 
except the ith, so that we have 

< -= 0 (r 4= 0, *; = ) - 

v a t 

On the other hand, the direction n ; of the normal to the co- 
ordinate hypersurface Xj = constant (the normal meaning the 
direction perpendicular to any direction drawn on the hyper- 
surface) has its moments n } , , all zero except the jth. For n,- must 
be perpendicular to each of the n — 1 directions s- (i =fc j), so 
that applying formula (38') to the values just found for the 
parameters of we can write 

n j\i )— = 0 (*4=i), 

whence — 0 for ?"4=j. The value of n, (y is therefore deter- 
mined by the quadratic identity between the moments, which 
gives 

^ " Ja«' 

if we suppose that the sense of is that of the x/s increasing; 
for the opposite sense the radical must have the minus sign. 

That the direction n, so defined (at a generic point) is actually 
perpendicular to any direction X (through the same point) on 
the hypersurface Xj — constant, follows from the fact that for 
every such direction the parameter X j is zero, and therefore 

Sin,,, A* « 0. 

1 The Butfix i is not of course an index of co variance. 
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Applying the above remarks, it can at once be seen that the 
angle tu^ between the co-ordinate lines i and k is given by the 
formula 

Cbijt 

COSO).^ = — y.=^L=-_ , 

while the angle a ik between the co-ordinate hypersurfaces x { = 
constant and x k = constant (i.e. the angle between their normals 
^ and n /r ) is given by 

_ a ik 

COS** - JjQck 

These formulae show the real meaning of the coefficients of 
ds 2 , and the geometrical interpretation to be given to their 
vanishing. 

We shall now try to find the geometrical meaning of the 
covariant and contravariant components of a vector R. For 
this purpose we shall calculate the orthogonal projections of 
R on the directions and n t . We get for these 

n 

R X Si = ^ r R r s r i = 
1 

n 

Rxn^ ^ r BTn %lr 

These results show that 72, and R l represent the projections 
of the vector R on the co-ordinate direction i and on the normal 
to the co-ordinate hypersurface x i — constant, multiplied by 
\/ a it and *J a n respectively. 

23. Definition of geodesies. 

We shall fix any two points A; B in a generic V n , and we 
shall try to find the shortest of the lines which join A and B. 
In a certain sense this problem is analogous to that of finding 
the points at whicli a function is a maximum or minimum, the 
solution of which is of course an important application of the 
calculus. Here, however, we are not trying to find points, and 
hence the values of one (or in general of n) variables which 
satisfy the required condition; we are trying to determine a 
line, and hence, analytically, to determine the form of n functions 
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(the parametric equations of the line). The problem is therefore 
of a higher order of difficulty: while the former led to equations 
in finite terms, the latter leads to differential equations. The 
solution of this problem and of others related to it is the principal 
object of the calculus of variations. We shall recall shortly the 
fundamental idea of this calculus, which does not differ in prin- 
ciple from the idea which leads to the solution of the other more 
elementary problem of the maxima and minima of a given 
function. 

To fix the ideas, we shall suppose that there is only one 




M 



X 



variable. We know that if a function f(x) has a maximum or 
minimum at x 0 , its differential df — f'(x 0 )dx is zero at that 
point (and therefore /'(# 0 ) = 0), whatever dx may be; in other 
words, for an infinitely small displacement to left or right from 
the point x 0 , f remains constant (except for infinitesimals of 
higher order). This can also be seen intuitively from the graphical 
representation of the function (cf. the points M and N in the 
diagram). The converse, however, is not true, i.e. when df — 0 
it does not necessarily follow that there is a maximum or 
minimum (cf. for instance the point P in the diagram). The 
maxima and minima must be looked for among the points where 
df=0. 

Let us now see how we can apply this method to the deter- 
mination of the shortest line joining A and B, without going 
outside a given V n (we may think of a line drawn ou a surface, 
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i.e. the case n ~~ 2). Let g be such a line; draw a line g\ having 
the same extremities as g 9 and infinitely near g, but otherwise 
completely arbitrary. We can consider g f as derived from g 
by an infinitesimal deformation, i.e. by displacing each point 
(p,, x 2 , . . . x lt ) otgto (x x + 8x l9 . . . x n + 8#J. If 7 is the shortest 
of these lines, its length is not changed 1 by this deformation 
(except for infinitesimals of higher order); hence if I is the length 
of g and 1+81 that of g', we have 

81 — 0, . , (30) 

whatever g' may be (subject only to the conditions imposed 
above), a condition analogous to the vanishing of rf/in the former 
case. Here too, however, it is to be noted that in general the 
condition (39) can be satisfied not only by the required line 
but also by other lines which do not give the shortest path from 
A to B. 

For instance, let A and B be on the same generator of a cylin- 
der. Then the shortest path is evidently given by the generator, 
which, as can easily be seen, satisfies (39). But all the infinite 
number of helices which pass from A to B, wrapping them- 
selves 1, 2, . . . times round the cylinder also satisfy the same 
equation. 

We shall call all the lines which satisfy condition (39) geodesies. 
They possess important characteristic properties, which can be 
deduced from (39); e.g. the osculating plane at any point of a 
geodesic on a surface is normal to the tangent plane to the 
surface — the property adopted on p. 103 as the definition of a 
geodesic. The lines of minimum length between two given 
points must be looked for among the geodesies through the two 
points. 

This is the definition which we shall use below; but it is to 
be noted that some writers in defining geodesies start from another 
property. We could in fact show that when a point A is fixed 
on a geodesic g, then for all points B (on g) sufficiently near A, 
g is the only geodesic joining A and JS, and is therefore the 
shortest line joining them. Hence we can also say that 
a geodesic is a line such that it forms the shortest path 

1 For a more rigorous and complete discussion the reader is referred to treatises 
on analysis. 
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between any two of its points, provided they are sufficiently close 
together. 

With this restriction the two definitions are equivalent. 
24. Differential equations of geodesies. 

We shall now examine the property, concisely expressed by 
the equation (39), that the length remains unchanged in an 
infinitesimal displacement which does not move the extremities, 
and see how to express it by means of n differential equations 
which the n functions 

x t = x t (s) (i = 1, 2, ... n) 

defining the curve g must satisfy. 
Let the equations of g' be 

x, =- x,(s) + 8x, (i = 1, 2, . . . n), 

where the Sr's are to be considered as infinitesimal functions of 
6*, vanishing for s — 0 and s = /, and having finite and con- 
tinuous first and second derivatives, but otherwise arbitrary. 

Take an infinitesimal segment PP X of g 9 of length ds; we 
have to calculate 8ds, i.e. the increment (or, as it is called, the 
variation) of ds in the deformation which displaces P to P' 
and P x to P [m If dx t (i = 1, 2, ... n) is the difference between 
the co-ordinates of P and of P l9 the corresponding difference 
after the deformation— which we shall denote by d{x t + S#,) = 
dx L + d8x n where dSx t is of course the differential of the function 
8x— is calculated as follows: 

The co-ordinates of P' are x t + Sx 4 ; 
those of P x are (x t + dx t ) + 8(x t + dx^; 
therefore the required difference is dx t + 8dx^ 

It follows that 

8dx, = rfS#„ . (40) 

a result which we shall at once make use of. 

We shall now take the expression for ds 2 , and calculate its 
variation, differentiating with the operator S. We have 

n n n 

2ds . 8ds = E ;jt 8<ij k dxj dx k -f- a Jk dx k 8d,Xj + Ej* a Jk dx } 8dx k . 
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Since the last two sums are identical, we can write this in 
the form (using equation (40) ) 

n n 

2ds . 8ds = 2^ 8a jk dxj dx k + 2JL Jk a jk dxj d8x k . 

Dividing by 2ds, and denoting differentiation with respect 
to s by a dot, this gives 

u n 

Sds = JS^ 8a jt ±j x t ds + a jk Xj d8x k ; 



and from this, since 



8a }l ^ i k p8x k , 



we get 8ds in the form 

M da n 
Sds JZ^ - - Sx /f tfs + £■„ a * i d8x k . 

i 9% i 
Now since the length of g is 

I ---= / ife, 

the variation which is to be equated to 0 in (39) is 

Si = fads, 

J A 

or 81 = f i |S* ifa + Z, . . (41) 

where we have put 

/B n 
2 jk a }k xjd8x k (42) 
A 1 

We shall leave (41) aside for a moment and examine the 
possibility of transforming the integral in (42) also into a form 
which explicitly contains the arbitrary variations Sx k . Integrat- 
ing by parts we get 

tn "IB rB n 

Z Jk a jk Xj 8x k \ — I d (a Jk #,) 8x k . 
1 Ja ' A 1 
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The integrated part vanishes, since 8x k = 0 at the extremities; 
differentiating the product, the other part gives 

/B n rB n 

Zjk *tt £ j ds8x k —\ ±j da jk 8x k . (42') 
A l J A 1 

Expanding da jk9 the sum under the second integral sign may 
be written 

^ J ki d ^ jfr x J x l 8x k ds 9 
1 ox t 



or, interchanging j and Z, 



C X,, 



1 



We shall take half the sum of these two expressions to repre 
sent the value of either. Substituting in (42'), we get 

I = Zjta* xjSx k ds - j\ (i jH + !|*] *, *, 8*,) 

We now return to (41) and insert in it this expression for I. 
Putting all the terms under a single integral sign, and talcing 
out the common factor 8x,,ds, we get 

81= - J±\-i2/"*x J z t + X J a ik Xj 

J A 1 V 1 VXk 1 



or, remembering the definition of Christoffel's symbols, 

/B n r n n \ 

2* { 2, a jk Xj + [j7, A] Xj x £ \ 8x k ds. 
a 1 w 1 J 

Putting 

n n 

= a,* a;} + 2,, [jl, k] Xj x h . . (43) 
the formula appears in the concise form 

/B n 
H k p k 8x k ds. . . . (44) 
A 1 
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The result of all these calculations is that (39) can be written 
in the form 

(*& t p t *x t d8 = 0 (39') 

J A 1 

Now since (39') must hold however the arbitrary functions 
hx k are chosen (subject to the qualitative conditions stated above), 
we must necessarily have 

p, f = 0 (k = 1, 2, . . . n); . . (45) 

for if not, we need only take each hx k with the same sign as the 
corresponding p f (which can be done without going outside the 
conditions imposed); the sum would then certainly be positive 
at all points of the arc of integration and therefore the integral 
would not vanish. This is the fundamental argument in the 
calculus of variations; by means of it we get from (39') (which 
is only (39) expanded) the n differential equations (45) which 
written at length are 

ij ajt x, + i A [jl, k] ij x t = 0 (k 1,2,... n). (45') 
i i 

It is convenient to write these equations in the form obtained 
by solving for the a;'s. To do this we introduce the quantities 

jf - S t a*j>„ .... (40) 

1 

and replace the equations (45) by the equivalent system of 
linear combinations 

p* = 0, 

or ^ + ij, {jl, i\ t.j x t = 0. . . . (47) 

These n differential equations of the second order in the n 
unknown functions x ( (s) are equivalent to equation (39) and 
are therefore the characteristic equations of a geodesic, when 
integrated, they give the parametric equations of the curve. 
By the ordinary theory of such equations, the integrals will 
contain 2n arbitrary constants, which can be determined by 
the condition that the geodesic passes through two specified 
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points, or that it starts from a given point and has a specified 
direction. 

It may be noted that the equations (47) contain only intrinsic 
elements, as the definition of a geodesic would lead us to expect. 

25. Geodesic curvature. 

The discussion in the preceding section provides us with an 
opening for the introduction of an illuminating and fertile 
geometrical notion relating to any curve x t x t (s) in 
our V n . 

We must first show that the quantities p k defined by (43), 
corresponding to a generic curve x L — - x t (s), are covariant (and 
in consequence the j/'h are contravariant), so that we shall 
naturally associate with every point of the curve I (which is 
a priori any curve whatever) the vector p of which they are the 
components. Suppose then that we pass by any transformation 
from the variables x to new variables and let p, c represent the 
values of the p,/8 calculated in the new system. We get from 
(41), through tin* in variance of Si, 

81 — / "L k p k 8x k d$, 
A 1 

and therefore 

0 ^ J p k 8% — p k 8x k J ds. 

By a similar argument to that used in passing from (39') to 
(45) we deduce from this that at every point of I we must have 

11 n 

1 1 
which expresses the invariancc of 

Z k p k 8x k 
1 

(a linear form in the arbitrary contravariant variables 8x k ) and 
therefore the covariance of the p k &. It follows from (46) that 
the reciprocal contravariant system consists of the p l9 s, i.e. of 
the left-hand side of equations (47). 

We shall use the term geodesic curvature of the curve I at 
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any point on I to denote the vector p whose covariant components 
are defined by (43), its contravariant components being in con- 
sequence defined by (46), or by 

n 

P* = Xi + 2,1 {jl 9 i}x 3 x t (i = 1, 2, . • . n). (43') 
i 

An immediate corollary is that the geodesies cure the lines whose 
geodesic curvature is everywhere zero. 

More generally we have for the length of the vector p an 
important property, pointed out by Lipka, 1 which we shall 
merely state without proof: The absolute value of the geodesic 
curvature is represented, as in ordinary space, by the ratio between 
the angle of contingence and the elementary arc, where the angle 
of contingence is defined as the angle, at one extremity of the 
arc, between the tangent at that point and the parallel to the 
tangential direction at the other extremity. 

Another important property is that the geodesic curvature is 
normal to the curve, which is equivalent to saying that 

%*Pt*t = 0, (48) 

i 

since the parameters of the tangent to the curve are the xJb. 
To prove this, take the identity 

71 

2;* <*kj v k xj = 1 
i 

(obtained by dividing (32) by ds 2 ) and differentiate it with respect 
to s. We get 

n n 

2X jk a kj x k Xj + a kj x k x } = 0, 
i i 

71 n da 

or 2Zj* a k) x k x } + S, w x k x } x x = 0, 

1 1 ox l 

and therefore, by (24'), 

n v n 

2Z jk a kj x k Xj + S jW [ jl, k] x k ±j x x + S jftJ [hi, j] x k x 5 x l = 0. 

1 "Sulla curvafcura geodetica delle linee appartenenti ad una vari^ta qualunque" 
in Rend. dtUa H. Aco. dei Lincei, Vol. XXXI (1st half-year, 1922), pp. 363-356- 
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Interchanging k and j t we see that the third sum i& the same 
as the second; hence, taking out the factor 2, we have 

n n 

2/* a kj &k £ j + Sj H [jl 9 k] X k Xj X t = 0. 

This is merely (48), with p k replaced by its value as given by 
(43); hence the assertion made above is proved. 

In ordinary space, as will at once be seen, the geodesic curva- 
ture coincides in direction with the principal normal, and in 
magnitude with the flexion or principal curvature of the curve. 

26. Extension of the notion of parallelism. Bianchi's derived 
vectors. 

We propose next to extend to a V„ the notion of parallelism 
or, more generally, of equipollence defined above for a V 2 . 

In this cane we have no criterion analogous to that used for 
the F 2 , as in general the circumscribed developable which formed 
the starting-point of the former argument does not exist. 

The differential law of parallelism, however, expressed by 
the symbolic equation (19), can be immediately adapted to the , 
case of a F n . To do so, consider a vector R drawn from a point I 
P of V„, and let R + rfR denote the equipollent vector drawn 
from a point P x of V n , very near to P. We can think of the V n , 
and therefore of the vectors R, R + rfR, as immersed in a 
Euclidean space S N , where N is a sufficiently large integer; we 
can therefore define the vectors R, R + rfR, not only by their 
(co variant or contra variant) components with respect to V n , 
but also by their components Y v , F„ + dY v (v = 1, 2, . . . N) 
with respect to a system of Cartesian co-ordinates y l9 y 2 , . . . y N 
in S N . Now consider an arbitrary infinitesimal displacement 
SP, contained in V n and drawn from P; it can be specified either 
intrinsically, by means of the Sa^'s (i = 1, 2, . . . n), or with 
reference to Cartesian co-ordinates by means of the Sy/s (v = 1, 
2, . . . N); but it is to be noted that while the first set are 
arbitrary the second are not, on account of the equations (§21) 
which define the y's as functions of the x's. We can also say, 
in geometrical language, that the displacement must satisfy the 
condition of being tangential, Le. of belonging to V v . We shall 
Refine the vector rfR, and therefore the parallel displacement 
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by means of the symbolic equation (19), which can be expanded 
(p. 107) into the form, analogous to (19'): 

S^Sy, = 0, .... (49) 

1 

which holds for all displacements satisfying the given condition. 

The only difference between formula (49) and (19'), which 
defines parallelism with respect to a surface, is that the sum- 
mation for v is from 1 to N, instead of from 1 to 3. All successive 
steps in the calculation follow automatically as in § 15, p. 107. 

We shall first write the equation in terms of the 8x'& by put- 
ting 

dR x 8P - kdY lf by v = hr fc 8x /c ; . (50) 

after transformations analogous to those formerly used, we find 
for the t's the expressions 

r k = % a ki dW + 'i fl [jl, Jc] W dx t {Jc - 1, 2, . . . w), (51) 

i* i 

an obvious generalization of formula (21"). Evidently, in view 
of the identity (50), we are here dealing with covarianl expres- 
sions (with respect to any transformations of the #'s). The 
reciprocal system 

I 

can also be expressed in the form 

r' : - dft' + 2 yZ f jl, i) R J dx t (i = 1, 2, . . . n), (51') 

i" 

in complete analogy to (21'"). 

From (49) and (50) we finally reach the intrinsic equations of 
parallelism: 

r k = 0 (k = 1, 2, . . . n); 
these are equivalent to r 4 0, or to 

dR! + 2^ { jl, i\ R j dx t = 0 {i = 1, 2, . . . n), (52) 
i 

which define the increments dR 1 of the contra variant components 
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of a vector R for a displacement parallel with respect to V n 
from P (of co-ordinates x) to P x (of co-ordinates x + dx). 

For the covariant components we find, as on p. 113, the 
equivalent equations 

dR, = \ t {il, j] Rjdxi (*' =1,2,... n). (52') 

This equation and (52) alike show that parallel displacement 
is an intrinsic operation with respect to the metric of V n . This 
was not a priori evident from the geometrical definition we 
adopted, which is expressed in formula (49), involving the use of 
a surrounding space S N . 

The equations (52) and (52') are, so to speak, identical with 
the equations (23) and (27) which hold for a V 2 , the only differ- 
ence being in the number of dimensions. It is of course under- 
stood that { jl, i} and {il, j) denote Christoffel's symbols of the 
second kind constructed with the ds 2 of V n . 

All the properties deduced from the equations of parallelism 
with respect to 11 surface can be extended without difficulty to 
parallelism in V n ; in particular, the properties that parallel dis- 
placement along any finite curve whatever is always possible, and 
in only one way, and that parallel displacement leaves unchanged 
the scalar product of two vectors, and therefore lengths and 
angles. We shall show in the following section that we can also 
extend the property of autoparallelism of geodesies, which we 
proved geometrically in the case of surfaces. 

We may also call attention here to the notion due to Bianchi 1 
of the derivative of a generic vector R along a curve T, R being 
a function of the points of I 7 . If the vectors R(s) at various 
points of T are not parallel along T, the contra variant simple 
system t\ defined by (51'), is not identically zero. Accordingly 
the quantities 

w-i-'Z+w}*^ <•->.*.•••«> 

may be considered as contravariant components of a non-zero 
vector Z)R which is also a function of the points of T. The 

1 Cf. "Sul pamlloliemo vincolato di I<avi-Civita nella metrioa degli spazi curvi", 
in Rend, della Ji. Aec, di Napoli, Vol. XXVIII, 1922, pp. 150-171. 
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vector DR has been called by Bianchi associated, and its direc- 
tion and length the direction and curvature associated point 
by point with the vector R(s). We prefer, however, the quali- 
fication derived, because in Euclidean spaces DR is precisely 
the vector commonly known as the derivative of R with 
respect to the arc s of T. In fact, if we assume Cartesian 
co-ordinates, the Christoffel's symbols vanish, and the pre- 
ceding expressions for the (ordinary) components of (DR) 1 

reduce to ~ £ . 



Returning to a general manifold V n9 if R(s) reduces to the 
versor which is tangential to the curve T 9 i.e. in particular 

dx~ 

if R l = — - — x„ we find that we are again dealing with 
as 

the vector p of geodesic curvature considered in the preceding 
section. 

It can be shown that in every case DR (if not zero) is perpen- 
dicular to R, and that it has other interesting properties demon- 
strated by Bianchi. For further details the reader is referred 
to the paper just cited in the footnote, or to the Appendix to 
Vol. II of the same writer's Leziani di geometria differenziale. 1 

27. Autoparallelism of geodesies. 

Analytically we may derive this property from the equations 
of parallelism by using the differential equations found above 
for geodesies. 

Let X denote a unit vector defined at all points of the geodesic 
under consideration and having everywhere the same direction 
as the geodesic. We shall show that X may be considered as 
undergoing a parallel displacement along the geodesic. 

Let its parameters be A'. From the definition of these para- 
meters, and using the parametric equations x t = x^s) of the 
geodesic in question, we plainly have 



ds 



and therefore 




1 Second edition. Bologna, Zanichelli, 1923. 
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Now the x 9 & and the x'a are connected by the equations 
(47) (the characteristic equations of the geodesies). Substituting 

A' and — for x t and x l9 these become 
ds 

f = ™ +Zj l {jl,i}>Jx l - 0 . . (53) 

or multiplying by ds, 

tfds = d\ { + % x {jl 9 i) Tidxt = 0, 

which are the equations expressing the parallel displacement of 
the vector X. 

It is worth noting that a comparison of (51') and (53) shows 
that the quantities p l ds are a particular case of thcr 7 's, the generic 
vector R being replaced by the unit vector X of contravariant 
components x,. There follows immediately the contra variance 
of the quantities p % 9 or, which comes to the same thing, the co- 
variance of the quantities p l9 which we proved directly in § 25. 

28. Remarks on the case of an indefinite ds 2 . 

We agreed (§ 20) to say that an n-dimensional V n is metrically 
defined when there is associated with it a differential quadratic 
form, with real coefficients a lk9 

it 

<f> — Yi iH a ik dx i dx k . 
i 

We then introduced the hypothesis that <f> is definite and posi- 
tive, and this is the only case we have considered in the fore- 
going sections. We now propose to make some remarks on the 
case in which <f> is still supposed irreducible (or such that its dis- 
criminant a is not zero), but is no longer definite, being capable 
of taking positive values for certain sets of differentials dx i and 
negative values for certain others. 

In this case also, fixing a generic point P of co-ordinates 
x h and an infinitely near point P' of co-ordinates x t + dx t9 we 
put 

71 

ds 2 = <f> = ^a^dx^dx^ . . , (54) 
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and we shall call ds 2 (which can now be positive, negative, or 
zero) the square of the line element (the distance), or better the 
interval between the two points P and P'. 

Among the oo n (real) systems of differentials dx h or, as we shall 
say, considering only ratios, among the oo' 1-1 directions drawn 
from P, there are oo n ~ 2 which satisfy the quadratic equation 

ds 2 = 0 (55) 

For a moment we shall interpret the differentials dx t as 
referring to Cartesian co-ordinates with origin P. Then the 
directions just defined, which are said to be of zero interval, 
constitute a quadric cone of vertex P. This cone divides the 
sheaf of directions drawn from P into two regions, in one of 
which 

ds 2 > 0, (56) 

and in the other 

ds* < 0 (57) 

All the directions in the first region are said to be of the first 
kind or timelike (the term being suggested by the interpretation 
given to these symbols in the theory of relativity); those in the 
second region are said to be of the second land or spacelike. The 
parameters of a direction of either kind are defined by the for- 
mula) 

A* = ° % (i = I, 2, ... n); . . (58) 

there is no analogous result for the directions of zero interval 
corresponding to which ds 2 — 0. 

For timelike directions ds 2 > 0; hence, if ds denotes the 
arithmetic value of the square root of ds 2 , we have | ds | = ds 9 
and the argument is exactly as it was for the definite quadric. 

For spacelike directions, on the contrary, we have 

n 

\ds 2 \ — — ds 2 = — a, k dx L dx k , 

i 

so that the quadratic identity satisfied by the parameters A* 
id 

£*a*A*A* - - 1, . . . . (59) 
i 
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with — 1 on the right, instead of + 1 as for the timelike direc- 
tions. 

Granted these results, the systematic extension of the pre- 
ceding sections to the indefinite case would certainly not seem 
likely to be difficult. As we are not aware that this has yet been 
done exhaustively, the reader's attention may be called to it. 
We propose merely to point out an essential fact, almost evident 
a priori and often used in the theory of relativity; namely, that 
the definitions, geometrical representations, and formulae in the 
foregoing sections can certainly be carried over and applied to 
the indefinite case, provided (a) that we take account of the 
exceptional behaviour of the directions of zero interval, and 
(b) that we make the obvious formal modifications necessitated 
by (59) when we have to deal with spacelike directions. 

We leave the matter here, 1 with two examples to conclude 
tli e disc 11 ssion : 

(1) The condition that two directions, whether timelike or 
spacelike, of parameters A', fif, may be orthogonal is in every 
case expressed by the equation 

a»«*Ay = 0. 

1 

(2) If we consider only lines wholly composed of timelike 
elements (ds 2 > 0), the discussion in § 24 holds without modi- 
fication, and we reach the same equations (47) of the geodesies. 



1 Bee Chapter XI, p. 287. 
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The Fundamental Quadratic Form and 
the Absolute Differential Calculus 

CHAPTER VI 

Covariant Differentiation; Invariants and Differential 
Parameters; Locally Geodesic Co-ordtnates 

1. Covariant differentiation. 

Returning to the remarks made on p. 8G of Chapter IV, 
we now propose to generalize the operation of differentiation by 
substituting for the ordinary derivatives of the elements of a 
tensor certain linear combinations of these derivatives and of 
the elements of the given system, which will in their turn con- 
stitute a mixed (or in particular, covariant) system with one 
index of covariance more than the given system. Explicitly, if 
AS ' ' ' f is the given generic system whose elements are functions 
of the x'b or, in geometrical terms, functions of position, we shall 
deduce from it another system A h S ' " [ h f v where I is a new index of 

dA h[ " ' * M 

covariance, which reduces to the system ~ - - — ? in the particular 
case when the co-ordinates are Cartesian. 1 

To simplify the formulas, we shall consider first a mixed system 
A h i with a single index i of covariance and a single index h of 
contravariance. 

Fixing our attention on a specific point of V n (i.e. ignoring 
the fact that the 4's are defined as functions of position), we 
know that the law of transformation of the functions A 1 * for 

144 
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a change of variables is defined by the invariance of the form 

f = ^au 1 ^, .... a) 

1 

in which the £'"'s constitute a generic contra variant system, or, 
in other words, are the contravariant components of a generic 
vector 5; similarly the can ^ e considered as the covariant 
components of a generic vector u. 

Now, since a set of values of the A 1 ?'*! is associated with 
every point of V n , we can at every point choose two arbitrary 
vectors J;> and construct an invariant form with them and 
the A 9 a. 

Suppose this choice made at an arbitrary but determined 
point P, and consider also a generic point P x infinitely near to 
P. We shall agree to take for ^ and u at P 1 the vectors parallel 
to those chosen at P\ as the displacement is infinitesimal, the 
curve of displacement is immaterial. We shall use the operator 
S to denote in general the increment of a quantity in passing 
from P to P t , and we propose to calculate 8F. Differentiating 
(1) with the operator S, we have 

1 

Now, by the convention just adopted as to the vectors % 
and 11, the differentials 8£ l and Su fl must be calculated by the 
formulae of parallelism ((52) and (52'), pp. 138, 139), while 
hA l l is given by the usual rule of differentiation 

n o Ah 

8A? = 8x lt 
1 ox L 

the-4's being by hypothesis functions of position. Using these 
results, we have 

8F = S„4^ ^u h 8x l -T, ih}l A h l {3l,i}u ll ^8x l 

1 OXi l 

+ i ihjl A h i {hhj}^u j hx l . 



Interchanging i and j in the second sum and k and j in the 
third, so as to get the factor £ l u h Sx t in all three sums,, and 

( D 666 ) * 
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collecting all the terms under a single summation sign, we have 

n r pj A h n - n -| 

8F== M^t -^4Hj}+S;^{^Mjr%8^ (2) 

Now the left-hand side of this equation is invariant on account 
of its meaning, while £\ Sx h u h are arbitrary contra variant or 
co variant systems; hence the coefficients of this form (the expres- 
sion in square brackets) constitute by definition a system which 
is covariant with respect to i and I and contra variant with respect 
to h. We can therefore put 

(A)\ = -^Af{il,j} +Z J A{{jl,h}. . (3) 

This system is called the cxyvariant derivative of the system 
A\. It is sometimes denoted by the symbol A f t \ l9 and also, when 
no ambiguity is possible, simply by A%. 

It is obvious that in Cartesian co-ordinates (which exist when 
we are dealing with Euclidean forms; cf. § 21 of Chapter V, 
p. 121) the system reduces to that of ordinary derivatives. 

The method used above can be applied, mutatis mutandis, 
to a generic mixed system. We shall always get for SF (as follows 
at once on carrying out the necessary operations) a multilinear 
form whose coefficients we shall define as elements of the co- 
variant derived system. These coefficients consist of a first 
term which is the ordinary derivative, followed by as many 
terms preceded by the minus sign as there are indices of co- 
variance of the given system, and as many terms preceded by 
the plus sign as there are indices of contra variance. If we denote 
by {i) the aggregate of indices i x . . . i m and by (It) the aggregate 
h t . . . hp, the general formula is 1 



4» = Ofi* - S r 2, A {h) ... . \i r lj) 

K_ x jh p+l . . . h tl 



(4) 



+ Z R X J A n --^^--"'{jl,h p }. 



1 Cf. A. Palatini : " Sui fondamenti del Calcolo DiflVrenzialc aasoluto ", in 
Rend, del Oircolo Mat. di Palermo, Vol. XLIII, 1919, pp. 192-202. Another 
vectorial illustration of covariant differentiation was given by the late Prof. 
Hksbknbkku in his paper "Voktorielle Begriindung der Difforentialgeometrie ", in 
Math. Ann., Vol. 78, 1917, pp. 187-217. 
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2. Particular cases. 

Consider first a covariant simple system A i9 which we can 
always interpret as consisting of the covariant components 
(moments) of a vector A. In this case the terms contributed by 
the indices of contravariance are absent, and (4) (or (3) ) gives 

r) 4 71 

A\i = ~ 1 — E, {ihj\A y ... (5) 
ox L 1 

It is easy to see that this double system is not in general 
symmetrical; from (5) however we get at once the important 
relation 

dA t dA g 

dx t dx, 



— A L \ L — ^ 1 — . . . (6) 



The vanishing of the covariant derivative A^ t has a simple 
geometrical significance. In this case, multiplying (5) by dx l9 we 
have 

dA- 11 

' d x L E, {ilJjAjdxt; 

0 Xi 1 

comparing this with equation (52') of the preceding chapter, in 
which we suppose all the dx's to vanish except the Zth, we see 
that it expresses the fact that the vector A undergoes a parallel 
displacement along the line 

Analogously, for the derivatives of a contra variant simple 
system A 1 , we have 

A\ t tiA +Z,Ai\jl,i\. . . . (5') 

Next, consider a system of order zero, i.e. an invariant/. In 
this case (4) becomes 

t'"li ; ■ 

or tJie covariant and the ordinary derivatives are identical. If we 
construct the system of covariant second derivatives, applying 
formulae (5) to (7), we shall have 

€X ii OX k 1 OXj 
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these are not the same as the ordinary second derivatives but, 
like them, are symmetrical. 

For a covariant double tensor (4) becomes 

r) A n n 

= ~* - Sj {* j} A jk - S, {hi, j) A i} ; (9) 
ox l l 1 

and for a contra variant double tensor it becomes 

2 A ik n n 

Af { + {jl, i) A* + *j {jh k} A». (9') 

OXj l 1 

3. Ricci's lemma. 

If formula (9) is applied to the system of the coefficients 
of ds 2 , we get, remembering the expression for the derivatives 
of these coefficients in terms of Ghristofiel's symbols (Chap. V, 
§ 16, p. Ill), 

a»H *= 0 k, I - 1, 2, . . . n). . (10) 

This important theorem, that tlie covariant derivatives of the 
coefficients a ik are zero, can be proved directly from the definition 
of covariant differentiation. To do so, we must choose two 
arbitrary vectors 5, *]> and construct the expression 

F = S,*a a rV; 

1 

we then calculate 8F corresponding to a parallel displacement 
of the vectors ^, and we shall get a trilincar form in f *, y k 9 8x h 
whose coefficients, by definition, will give the required derived 
system. 

Now F is merely the scalar product of the vectors J; and yj, 
which, as we know, is not changed by a parallel displacement; 
hence we shall have SF = 0 for any values whatever of 5, *)> 
and Scc's, which means that all the coefficients of this form vanish 
identically. 

Similarly we can show that the covariant derivatives of the 
reciprocals a v>( vanish; in this case we have to use the expression 



n 
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which is again the scalar product of the (arbitrary) vectors 
u and v. 

4. Contravariant differentiation. 

There is in the absolute differential calculus a kind of law 
of reciprocity or duality in accordance with which we can deduce 
from every theorem or formula a reciprocal theorem or formula, 
by interchanging the words covariant and contravariant, and 
lowering or raising the indices. We have already had several 
examples of this; we shall now make some brief remarks on the 
operation of contravariant differentiation, which corresponds to 
that of covariant differentiation just described. 

The shortest way to deduce from a system A^ the system 
A^ k which has the properties reciprocal to those of the co- 
variant derivatives, is to find the covariant derivative of the 
given system and then compound it with the system of the a kl9 s; 
i.e. to make 

A im - S «» A ih) 

We could find for this system an expression analogous to 
(4) and properties corresponding exactly to those of the covariant 
derivatives; or we could find these properties directly from those 
of the covariant derivatives, by using the foregoing formula of 
definition. We shall therefore not pursue the argument in detail, 
and shall instead resume our discussion of the fundamental 
properties of covariant differentiation. 

5. Conservation of the rules of the ordinary differential 
calculus. 

First, consider a tensor, in general mixed, which is the sum 
of two others of the same rank and species, i.e. 

Ah) _ (h) (/O 

It will at once be seen that the covariant derivative of the 
system A is obtained, like an ordinary derivative, by adding 
together that of B and that of C, or 
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This formula follows cither from the linearity of (4), or from 
the consideration that the form F relative to A is the sum of a 
form relative to B and a form relative to C, so that a similar 
result holds for 8F; the coefficients of the latter expression 
(which are by definition the derivatives A^) will therefore be 
the sums of the corresponding coefficients of the other two 
(which are by definition the derivatives l£ h \ and C^X The 
reasoning can be extended without difficulty to a sum of any 
number of terms. 

Next, consider the derivative of a product. If Z?^, are 
two generic tensors, we shall denote their product by 

M _ <m 

where the symbol (i) stands for the aggregate of the indices 
(i') and (i") together, and similarly for (k). We shall show that 

.CO jdh) {h") <h') (h") 

To simplify the formula* we shall suppose that the systems 
A and B have each only one index of covarianee and one of 
contravariance. We know (Chapter IV, § 8, p. 76) that if 

cf, = s 

are the invariant forms for the systems B and C, that for the 
system A is 

F - <f>if>. 

We shall therefore have 

8F if,8<j> + <f>8*/*, 

and equating the coefficients of £ l ' rf u h *v h » Sx t on both sides 
of this equation wc get equation (12) (for the particular case 
considered). 

Now consider the derivative of a compounded mixed system 
(Chapter IV, p. 79) 

A (i) — Mr)(*) n (0(r) °<i")<*)- ' • * 
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where (i) and (h) have the meanings already explained, and 
(r) and (s) denote the aggregate of all the indices affected by the 
process of contraction. We shall show that 

^(OZ — -^(r)(«)l ^(£0(r)« + -*V)(r) J- V 14 ) 

In particular, if each aggregate reduces to a single index 
and if the process of contraction is applied only to one index, 
(13) becomes 

A'$ - S,Z£Cf (13') 

and (14) becomes 

A% = £, Cf + B* C$']. . . (14') 

We shall give the proof for this simpler case, merely point- 
ing out that it can be immediately extended to the general 
case. 

We start from the invariant forms relative to the systems 
B and C 

1 

L 

where we have followed the same procedure as in Chapter IV, 
p. 78, and introduced a set of n contravariant systems X r a 
(a — 1, 2, . . . n) and the associated reciprocal set. The 
invariant form 

i 

has the A 9 s as coefficients, as we saw in Chapter IV. 
Applying the symbol of operation 8 to this we get 

1 

and equating the coefficients of i i rj J u h v k 8x l on both sides of 
this equation, we get (14')- 
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To sum up, we have shown that the fundamental rules of 
ordinary differentiation hold good for covariant differentiation. 

6. Applications. 

We note first of all that if we start from a generic simple 
system (a function of position), say a covariant system V i9 
and consider its reciprocal V\ we have by definition 

F £ = ± k a*V k ; 

1 

hence, taking the covariant derivative and using Ricci's lemma, 

V\ t - £ k a*V kU (15) 

We shall next calculate the covariant derivative of the scalar 
product X of two vectors, which, as we know already, is identical 
with the ordinary derivative. 

Let U, V, be two generic vectors, and put 

X = U X V = E< : Ui V\ 
i 

Taking the covariant derivative, we have 

X L « £ [17,1,7'+ cr,Ffd. 

In the second term on the right we can replace V\ t by the 
expression for it in (15), so that 

S, U, V\ t - S„o* J7 £ v k[l = i k u k v k] , 

Changing k into i, and substituting in X h we get the formula 

X t = ^.[U^V'+U'V^l . . . (16) 

which is often used. 

In particular, if V = U, we have X = U 2 , and therefore 

X t = 2U|H = 2S t E7<EV . . (16') 
ox, i 
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7. Divergence of a vector and of a double tensor. oX 
an invariant. 

Take a covariant simple system X l9 which we can always 
think of as the aggregate of the components of a vector X, and 
construct the invariant 

0 = ija-JT,,,, (17) 

where the terms X,^ denote covariant derivatives. In the 
particular case of the fundamental form being Euclidean, we 
have a 11 = 8', and also the covariant and ordinary derivatives 
are identical; hence in this case (17) becomes 

s = 

1 ox t 

In three dimensions this expression is called the divergence 
of the vector X. We shall extend the use of this term to the 
general case (17). 

We can transform (17) by means of (15). Writing X instead 
of V, (15) becomes, for / i, 

71 

X\ t — ^ k a ,L X k ^. 

Summing with respect to i, the right-hand side gives 0, 
as can be seen at once from (17) by putting I instead of k and 
then interchanging / and i. Hence we have 

e = kx\ (no 

1 

From the general rule for covariant differentiation, or more 
specifically from (5'), we have 

Now sum with respect to i. Substituting from (17') on the 
left, and from the identity 



'54 
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(cf. formula (26), Chapter V, p. 112) on the right, and writing 
I as the index of summation on the right instead of i and j, we 
get 

= )- °f a x\ 

l i \dx l s/a Sx t / 



or, taking tlie factor * outside the summation sign, 

V a 

e = - 1 S, J* WaX 1 ). . . . {IT') 
s/a i dXi 

This expression for the divergence is completely equivalent 
to the formulae (17) and (17'); it is more useful for purposes of 
calculation, while (17) and (17') on the contrary are more suited 
to theoretical discussions. 

In particular, consider the case where the vector in question 
is the gradient of an invariant u 9 i.e. where 

X t - l U (* - 1, 2, . . . n). 

cx L 

In this case the divergence is denoted by the symbol A 2 u 
and is called the second differential parameter of the function u\ 
the expression for it can be deduced at once from (17) or from 
(17"), using in the calculations the fact that 

u l = \a a . 

i dx, 

We thus get 

A 2 u i ik a ik u ik = ) £ ?- {s/au% . (18) 



both these expressions being generalizations of the ordinary 
expression for A 2 in Cartesian co-ordinates. 

Next, take a contravariant double tensor X tk . We note 
first of all that if instead the given tensor were covariant (X ih ) 
or mixed (Xf), we could always compound it with the a^'s and 
so obtain an associated tensor in which both indices are indices 
of contravariance; so that the choice of a contravariant tensor 
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does not really constitute a restriction. From this tensor, taking 
the covariant derivative and applying the process of contraction, 
we get the contra variant simple system 

Y l = (19) 

i 

which, by an obvious analogy with the former case, is called the 
divergefice of the given double tensor. If the process of contraction 
were applied to the first instead of to the second index, we 
should plainly get a contra variant system 

u 

^k | k > 
1 

in general this is distinct from the divergence Y\ coinciding with 
it only in the particular case when the given tensor X lk is sym- 
metrical. Vice versa, if X lk is the system reciprocal to X lK (the 
indices corresponding in the order written), we see at once from 
the rules in § 5 that the system 

n 

i 

is merely the covariant system reciprocal to (19). Returning 
to (19), it should be added that the expression on the right 
cannot in general be transformed (as was done for the ordinary 
divergence (37)) into an expression which is convenient for 
actual calculations. In the case of an antisymmetrical tensor 
(X l/f + X hi = 0), however, the analogy in this respect is per- 
fect. In fact, if we substitute in (19) the values of Xf k given 
by (9'), the second term on the right vanishes from the anti- 
symmetry of the X's, while the other two give 

n rs jrik n 



1 o x k ] 

From this expression, by the same method as 
to pass from (17') to (17"), we get the equation 



that just used 



s/a i 
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8. Some laws of transformation, e-systems. Vector product. 
Extension of a field. 

Consider a set of n co variant simple systems A tt ^ (where 
a is the ordinal number of the system and i the index of 
covariance) and the determinant of the set 

V = || A.,, ||. 

Changing the co-ordinates from the x's to another set of 
variables x, the systems A a{i are transformed (in accordance 
with the law of covariance) into another set of systems 
Construct the determinant of these new quantities 

V = ||A a|l ||. 

We shall show that the relation between V and V is 

V = VA (20) 

where D denotes the Jacobian determinant of the transformation, 
i.e. 



/ X l X 2 • • • X n\ 

\X± 'J>2 . . . X tl / 



which is of course not zero, it being always supposed that we are 
using a reversible transformation (§ 2, p. 3). The relation (20) 
can be verified at once if we construct the product by rows of 
the two determinants on the right, viz. 



12 



^2 1 1 ^! 



2 1 2 



12 



Mil 



dx^ dx 2 

dx v dx 1 

dx x dx 2 

d d x 0 



d x x 
dx 2 



d x^ d x 2 
dx n dx n 



dx n 



Recalling (§ 11, p. 80) that 



(21) 



SOME LAWS OP TRANSFORMATION 157 

we see at once that the elements of the product determinant are 
precisely the quantities A aJi . 

It is also useful to examine the behaviour of the discriminant 

a = || a lk || 

of the fundamental form when we change from the variables 
x to the variables x. For this purpose, we take the trans- 
formation law of the coefficients a ik (§ 12, p. 85), 

" dx. dx h 

P uttin S b Jb = ha Jk p, (22) 

1 dx k 

we can write this law in the form 

v u dx i 

This law, which is completely analogous to (21), enables us 
to conclude at once, from the example of the preceding case, 
that the relation between a and the determinant b of the quan- 
tities b jk is analogous to (20), i.e. that 

a bD (23) 

Further, as (22) is of the same type as (21), the determinant 
b will be connected with a by the relation 

6 aZ), 

which, combined with (23), gives us the required relation between 
a and a, namely, 

a - - «2) 2 (24) 

It follows from (20) and (24) that the ratio -y is an absolute 
invariant, i.e. that a 

V _ V 
n/ a \f a 

Strictly speaking, this equality holds except for sign; but if 
we agree to change the sign of the radical when a transformation 
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is made for which D is negative, it holds in sign as well as in 
numerical value. 

The remark just made leads us to define a particularly useful 
tensor whose elements can be expressed in a very simple form. 

In fact, we note that the quantity which we have just 

V a 

seen to be invariant, is merely a multilinear form in the n sets 
of variables A rt |, ; ; this is seen at once by expanding the deter- 
minant V in the usual way, as the sum of products of its elements 
n at a time, where no product contains two elements from the 
same row or column, and with the usual rule as to sign. We 
may write the result in the form 

-7- = /- S^...^ ± A 1( A 2lAs . . . A /(|t . n , . (25) 

where the symbol S denotes the sum of all the possible pro- 
ducts, subject to the conventions stated as to their structure 
and sign. Since this form is invariant, its coefficients constitute 
a contravariant system. Tf we put e' 1 - ■ ' n for the coefficient of 
the product Ajj ti A 2 |^ . . . A, 4 | tJl , we see at once that we have: 

€ iik- • *n _ 0 jf a ^ least two of the indices i x i 2 . . . i n are 
equal; 

€ tf — \ if these indices arc all different and con- 

NX a 

stitutc a permutation of even order with respect to the 
fundamental permutation 1, 2, ... n\ 

€ *\ »a . . . U — 1 ^ t j ie i TU ]i ceR are a i] different and constitute 
\/ a 

a permutation of odd order. 

Hence it follows that the system of order n whose elements 
are 0, -i, — y_, respectively according to the rules just stated, 

is contravariant; we shall call it the contravariant e-system. 

We can give an analogous definition of the covariant e-system 
by considering the determinant (reciprocal to V) 
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constructed from the reciprocal elements of the systems in 
the determinant V; these elements, as we know from Chapter 
IV, p. 74, constitute a set of n contra variant simple systems. By 
a well-known theorem, which can at once be verified, we have 

VA - 1; 

hence the quantity v^aA ( the reciprocal of -Y \ will be invariant. 
Expanding the determinant A, this can be written in the form 

where the symbol S as before denotes the sum for all per- 
mutations of the indices i. 

It follows that the system whose elements e ixii . in are zero 
if the indices i x i 2 . . . i n are not all different, and are equal to 
%J a or — V a if the indices are all different according as the per- 
mutation i x i 2 . . . i tl is of even or odd order, is covariant. 

The use of the same letter e for both is justified by the fact 
that this covariant system is the reciprocal of the former system. 
This statement can easily be verified by the reader. 1 

By means of the e-systems, when n — 1 vectors v a (a — 1, 
2, . . . n — 1) are given, we can deduce from them (by invariant 
processes) an nth vector w, which is called their vector product, 
as in three-dimensional Euclidean space it is identical with the 
ordinary vector product. If v\ and v a ^ (i — 1, 2, . . . n) denote 
the contra variant and covariant components of the n — 1 given 
vectors, the formulae 

U\- = £i t ^ . . . €i i v i 2 . . i n _i ®i V* . . . ^ n _ 1 > 

^ = Ki*.. fc _ l €*^-' ,1 - 1 «,„ l «iH...Vj|4 l . 1 

define two reciprocal systems, as can easily be verified; hence 
either separately defines the same vector, which we call w. 
When n - - 3 and the sx>ace is Euclidean the components of w 
do in fact reduce to those of the ordinary Vector product. 

1 For this and other properties of the e-systems, cf. an interesting 1 note by 
TjTT»ka : "Sui sistemi E nol ualeoln difFer»*nziale assoluto ", in Rend, della H. Acc. 
dei Lined, Vol. XXXI (first half-year, 1922), pp. 242-245. 
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In any case it follows from the preceding definition of the 
components w i (or vf) that w = 0 if the vectors v a are not all 
linearly independent, i.e. if the characteristic of the matrix com- 
posed of their components < n — 1; when they are independent, 
w =# 0 and is perpendicular to every v a . The latter property 
follows from the consideration of a generic vector product w X v a . 
Taking, say, the first group of formulae, we have 

n n . . 

which is zero from the definition of the e-system, or, in other 
words, because the sum is the expansion of a determinant with 
two rows the same. 

Lastly, we wish to introduce into the metric of a V H the 
notion of the extension of a field, i.e. to define, for a given field 
of V n , a quantity V analogous to the area of a portion of a 
surface or to the volume of a three-dimensional field. Evidently 
we have a priori a free choice as to the definition of dV, provided 
that when n -2 it reduces to the expression already given 
for the element of area (formula (17), p. 99), and that when 
n — 3, in Cartesian co-ordinates, we have dV — dxdydz\ 
further, from the geometrical meaning of the term, the extension 
V of a field must be an invariant. 1 All these conditions are 
satisfied if we assume 

dV = +/adx l . . . dx ny . . . (26) 

where s/ a denotes the arithmetical value of the radical, and 
therefore 

V = f^+ZadXi . . . dx n . 

We know in fact that on a change of co-ordinates the pro- 
duct dx x dx 2 . . . dx n must be replaced by | D | dx l dx 2 . . . dx n . 
From (24), extracting the square root, and taking the absolute 
values of both sides of the equation, we get 

| s/a | . | D | dx ± dx 2 . . . dx n — %/a dx x dx 2 . . . dx tl . 

1 A detailed study of the concept of extension and of its analytical expression has 
recently been made by O. Holder. Of. " Das Volumen in einer Riemann'scben 
Mftnnigfeltigkeit in Math. Zeitsckrift, Vol. 20 (1924), pp. 7-20. 
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But the left-hand side is sf a dx x dx z . . . dx n , which is there- 
fore invariant. 

9. Rotor of a simple tensor in three dimensions. 

We can now give a definition of the rotor (or rotation, or 
curl) of a vector X given as a function of position, which shall 
hold good both when the space considered is not Euclidean, 
and also when it is Euclidean but the co-ordinates are not Car- 
tesian. For any value of n, the generalization consists in defining 
as the rotor the covariant double system 

Pa = — 

which is obviously antisymmetrical, since p d + p lt — 0 identi- 
cally. As we saw in § 2, the p's can also be written as the differ- 

?) x ^ x 

ences of the ordinary derivatives — - — - -- ; if then we con- 

dx t ox i 

sider the X's as coefficients of a Pfaffian 

<p = Z^dxt, 
i 

it* will be seen that the //s are merely the coefficients of the 
bilinear covariant of this Pfaffian (cf. Chapter II, p. 20). 

To get the full analogy to the ordinary rotor, however, we 
should consider a space of only three dimensions. For n = 3, 
there are three different elements p d — — p u > corresponding 
to the pairs of different suffixes 23, 31, 12, pairs of equal suffixes 
giving zero values of the p's. Each of the pairs 23, 31, 12, can be 
associated with the absent suffix (1, 2, or 3 respectively), or, 
in a general formula, the index h can be associated with the 
pair of the type h + 1 , h + 2, with the convention that suffixes 
which differ by 3 are to be considered equivalent; for instance, 
if h ~ 2, h-\- 2 represents the suffix 1. It is therefore easy to 
understand how when n — 3 the rotor can be represented by 
a simple instead of a double system. If, however, we were to 
put 

Ph ~ Ph+l, 7i + 2> 

the simple system so defined would be neither covariant nor 
<jpntravariant. Instead, it will be convenient to apply the tenn 
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rotor to a vector B whose contravariant components R h are 
defined as follows (with the help of the e-systems introduced in 
the preceding section): 

R" = £ u e ha Xi\i (A = 1,2,3). 

1 

The contra variance of R h follows immediately from the prin- 
ciple of contraction. In order to see the analogy between this 
expression and the ordinary rotor, note that in the double sum 
i and I can take only the values h + 1, h + 2 (since the € corre- 
sponding to the value h would be zero); since i and I must also 
be unequal, there are two possible cases: 

i = h + 1, I = h + 2, when e hu 



i = h + 2, / = h + 1, when e' 



.hit 



1 

%/ a 
1 

\/ a 



Hence this sum will have only two terms, and R h can be 
written in the following form: 



va 



or 



the latter being convenient for actual calculations. In Cartesian 
co-ordinates a — 1, and we get the ordinary expression for the 
components of a rotor (it being supposed that x l9 x 2 , x 3 corre- 
spond in order with x, y, z). 

10. Sections of a manifold. Geodesic manifolds. 

We know that in ordinary space S 3 if we are given two direc- 
tions X, {JL starting from the same point P and defined by their 
cosines A 1 , p} (i — 1, 2, 3), every other direction % through 
P whose cosines are linear combinations of those of X and fi, 
i.e. £' = pA ? + ctal'» lies in the plane determined by X and ji. 

The coefficients p and a are of course not independent, as the 
£'s must satisfy a quadratic identity; we have in fact 

p 2 + a 2 + 2pa cosXfX = 1. 
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The directions 5 so defined are therefore simply infinite in 
number, and their aggregate is called a section. 

All this can easily be extended to a generic V n , in which 
m directions X a (a — 1,2,... m) are given. 

Take m multipliers p a , for the moment arbitrary, and con- 
sider the directions § whose parameters are 

in 

f £ = S o/0(I Ai, (27) 

1 

and consequently whose moments are 

€. = (27') 

1 

In order that these expressions may effectively represent 
parameters and moments respectively, it is necessary and suffi- 
cient that they should satisfy the relation 

i 

m ii 

that is to say, S ttja p a p ft S t A; t A^ , , = 1, 

A 

or, denoting the angle between the direction X a and X^ by a£, 

HI A 

p. pp coRafi --= 1 (28) 

i 

Now suppose that the p's are connected by this relation but 
are otherwise arbitrary. We then see that (27) (or (27') ) defines 
an aggregate of oo m-1 directions (this being the number of arbi- 
trary parameters), including in particular the m given directions; 
this aggregate is called a section. 

A section G being defined in this way by means of m of its 
directions X a , take in it any m directions X' a whatever (a = 1, 
2, . . . m). It is almost obvious that the section G' determined 
by these directions is again G itself. 

This can of course be verified algebraically. In fact, if a 
direction £ belongs to G', this is equivalent to saying that its 
parameters are linear combinations of the parameters and 
therefore also of the parameters A£; i.e. the direction £ also belongs 
to G; and vice versa. 
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We saw in Chapter V, p. 130, that a geodesic is uniquely deter- 
mined if its starting-point and direction are given. Now let us 
fix a point P in a V n , and draw from it two directions X, (x; 
these will determine a section of oo 1 directions drawn from P. 
Consider the go 1 geodesies drawn from P in all these directions: 
they constitute a surface (oo 2 points) which is called a geodesic 
surface with pole P. 

A geodesic surface is therefore determined by a point and 
two directions. 

A similar definition can be given of an m-dimensional geodesic 
manifold. Take a point in V n , and m directions drawn from it, 
which will define a section of go™" 1 directions, and construct the 
geodesic corresponding to each of these directions. Since each 
geodesic contains oo 1 points, the aggregate of all of them will 
contain oo*"" 1+1 points; i.e. it will constitute a manifold V m , which 
we shall call a geodesic manifold. 

Particularly important cases are the geodesic s urfaces (m =■ 2), 
and the geodesic hypersurfaccs (m = n — 1) determined by n — 1 
directions drawn from a point; we shall use these in the following 
section. 

11. Locally geodesic (or locally Cartesian) co-ordinates. 

In general, a system of co-ordinates in which ds 2 is repre- 
sented by a form with constant coefficients is called Cartesian. 
It is not always possible to choose co-ordinates of this kind in a 
given V n \ it is however always possible to find a system of 
co-ordinates which behave like Cartesians in the immediate vicinity 
of a point P assigned beforehand, or, more precisely, which are such 
that the derivatives of the coefficients of ds 2 (which would vanish 
identically if the co-ordinates were Cartesian) all vanish at the 
point P. Such co-ordinates are called locally geodesic, or locally 
Cartesian, co-ordinates. 

Their interest from the point of view of parallelism, or more 
generally of elementary equipollent displacement, appears plainly 
from equations (52) and (52') of Chapter V, pp. 138, 139, which 
define the increments of the contra variant and co variant com- 
ponents respectively. It follows from these equations that when 
the system of reference is geodesic at P, these increments, in 
passing to any very near point, are zero, precisely as are those 
of the ordinary Cartesian components in Euclidean spaces. 
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Now take a and in it any system of co-ordinates x; we 
propose to introduce— if this is possible— a new set of vari- 
ables 

= fi (*i> # 2 > • • • #n) (i = 1, 2, . . . n) . (29) 

such that the x's are geodesic co-ordinates at P, or in other words, 
putting a ik for the coefficients of ds 2 in the new variables, such 
that 

( dS A = 0 «, t, Z = 1, 2, . . . n), . (30) 

where the use of the suffix P denotes that after differentiation 
the x'b are to be replaced by the co-ordinates x f , of P. Remember- 
ing the definition of ChristoffeFs symbols (Chapter V 5 pp. 109, 
110), we see that (30) is equivalent to the condition that these 
symbols themselves are all zero at P, i.e. that 

{jhi\j. = 0 (j, 1, 2, . . . n). . (30') 

The following analysis shows the possibility of finding a 
set of functions / ( to define a transformation of this kind. 

The condition (30) consists of n*\n(n + 1) equations con- 
taining the first and second derivatives of the /'s (since a, :/( , by 
the law of covariance, can be expressed in terms of the a /A .'s 
and the first derivatives of the/'s). Now the number of first 
derivatives is n 2 , and that of second derivatives is n*ln(n-{- 1), 
so that the number of both together is greater than the number 
of equations. Since, as we shall see, the equations are not 
algebraically inconsistent, it follows that we can solve the 
equations (30) for the values at P of the first and second 
derivatives of the /'a, or rather for some of them, the others 
remaining arbitrary; further, the behaviour of the functions 
at points other than P is a matter of indifference. Thus 
the choice of the /'a can be made with a wide degree of 
arbitrariness. 

To avoid, however, the direct discussion of the equations (30), 
we shall start from the ideas contained in § 26 of Chapter V, 
p. 138* We saw then? that the expressions 



166 ABSOLUTE DIFFERENTIAL CALCULUS 



constitute a contravariant simple tensor, the vector § and there- 
fore its contravariant components g\ and also the differentials 
dx h being all arbitrary. This holds in particular for the 
hypothesis = dx b i.e. when we suppose 

r< - d 2 x t + % t \ jl, i \ dx J dx £ . . . . (31) 
i 

If on changing the variables we have at a special point P 

I* 1 = K (32) 

then at that point, from the law of contravariance 



S. 



i dx k ' 
it follows that 

t 1 = t' (S3) 

If we suppose (as we are always free to do^ by making a 
preliminary change of variables from x t to x, + a constant) 
that the x/h vanish at P, the equations (32) are satisfied pro- 
vided the formulas of transformation (29) are of the type 

*. = x i + <j>*( x i> • - • x n)> . - (29') 

where <f>, denotes a function of the sc's which is regular at P, 
and whose expansion in a series of powers of the #'s begins with 
terms of at least the second degree, e.g. a polynomial of the 
second degree in the ;r's. In fact, if these conditions are ful- 
filled, all the first derivatives of the <£'s vanish at P. The second 
3 2 <£■ 

derivatives ^ are identical with the second derivatives 
a2 . dx f dx t 

— - - , and give the terms of the second degree (by Maclaurin's 

dx ) dx l 

theorem) on the right-hand side of the equations (29'). By a 
suitable choice of the numerical values of these second derivatives 
at P, we can make all the Christoffers symbols for the variables 
x vanish, so satisfying the equations (30'), as we shall now show. 

In fact, writing out both sides of equation (33) in full by 
means of (31), and considering the x'&, in virtue of (29'), as 
independent variables (with their second differentials zero) and 
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the x's as functions of them, wo can write (33) in the form 

u n 

d 2 x t + [hk, i\dx h dx k = E,, {jl, i)dXj dx L . 

Equating the coefficients of dXj dx t on both sides and remem- 
bering (32) we get 

~ {A<( - : • • (s4) 

from which it appears that we need only take 

3 2 x 

n n ' = (i> * = i> 2, . . . w) 

at P in order to have 

{ji, i\ r = 0 

for every possible set of values of j, I, i. Q.E.D. 1 

1 Prof. Fermi hat* recently established an important extension of this result by 
showing that, given any curve whatever, it is also possible to choose co-ordinates 
which are locally geodesic at every point of the curve. Of. his notes "Supra 
i fonomeni che avvongono in prossimita di una linea oraria" in Itend. delta Ji. Arc. 
dci Lined, Vol. XXXI (first half-year, 1922), pp. 21-23, 51 -J>2. Fermi's result 
can be quickly justified as follows, by calculating the number of available unknowns 
and of conditions to be satisfied. 

Take the equations of the curve L in the form 

#i — Xi^n) (i = 1,2,... n - 1), 

as we may always do by considering a suitably limited segment. Note first that 
if the values zl of a generic function z{x if x~& . . . x H ) and of its partial derivatives 
with respect to x lr x*, . . . js w _j are known at all points of the curve, then the values 

of also are determined at all points of the curve. This is obvious if we take 
dx n 

the identity .r,, . . . x n ) = zz,{x n ) which holds at all points of Z, and dif- 
ferentiate it, so getting 

dz _ dz L _ M - 1 dz rfaci 
dx n dx n A * dx x dx n ' 

Now suppose that we make a change of variables of the general type (29), and 
that we wish to determine, if possible, the n functions f t {x„ x>, . . . x n ) so as to 

make every — 0 along L. As has already been noted in dealing with a single 
dxi 

point we thus get n» £n(n+l) conditions involving the first and second deriva- 
tives of the /'a. Now the number of first derivatives j^f* is ri 2 f and that of the 

a a /i dx * 

second derivatives ^ is «-Xn(n + l); but from the preceding remark, the » 2 

ox h d.vk " d 2 f- 
of the latter, which are of the type - ---^ — (t, k = 1, 2, ... n), can be expressed 

ox n dx k 
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It is not inapposite to give a geometrical interpretation of 
the conditions imposed on the co-ordinates x in order that (30') 
may be satisfied, or, in other words, in order that they may be 
geodesic at P. These conditions may be put in the following 
form: 

(a) The n co-ordinate hypersurfaces passing through P must 
behave as geodesic hypersurfaces with respect to points infinitely 
near P (or, in particular, must be geodesic everywhere). 

(6) If through a point P', infinitely near to P and on one of 
the n co-ordinate lines through P— say that along which x t 
alone varies— wc construct the direction parallel to another of 
the co-Ordinate lines, this parallel must belong to the co-ordinate 
surface x t > = constant which passes through P'. 

(c) When the co-ordinate hypersurfaces are fixed in accordance 
with the foregoing conditions (which, as is geometrically obvious, 
is always possible), the numbering of these surfaces (i.e. the way 
in which they are associated with the values of the parameters 
Xi, x 2t . . - x n ) must be carried out so as to satisfy certain numerical 
conditions which we shall subsequently specify, and which, as 
we shall see, can always be satisfied. 

That (a) and (6) are consequences of (30') follows immediately 
from the equations of parallelism and of geodesies. Recipro- 
cally, we shall show that a system of co-ordinates which satisfies 
the conditions (a), (6), (c) is geodesic at P. 

We shall begin by expressing the condition (a) analytically. 
Take a direction with parameters dx k , drawn from P and lying 
in the hypersurface x £ - - constant (so that dx t -~ 0). We have 
to express the fact that the geodesic in this direction behaves at 

at points of L in terms of the others and of the first derivatives. There remain 
altogether, including both first and second derivatives, -+- (n-Jn(n + l) — 
= n-£n(n + 1) unknown functions of se w to determine by means of the same 

number of equations = 0. These last equations, as can at once be seen, 

contain the second derivatives + — (^ h < n) in finite terms (in fact linearly), 
while the unknown values of the first derivatives appear together with the 

terms -J- In any case we have a system of as many equations as there are 

dx n OXie 

unknown functions of x n alone to determine. When the values of these deriva- 
tives are known on />, we can determine, with a wide d«gree of arbitrariness, func- 
tions ft which admit of these values. This can be seen by taking a Taylor expan- 
sion of the /'s as a function of the n - 1 arguments x^ - xj, x s — acj, . . . x n ^ — x„^ v 
where the quantities acj (i ±= 1, 2, . . . n — 1) are the values of the x»*» on L, i.e. of 
the functions Xi(se n ) which define this curve. 
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P as if it lay on this hypersurface, i.e. that d 2 x ( vanishes along 
this geodesic. It follows that dx £ = d 2 x t — 0, and therefore, 
from the equation of geodesies, 

ty{jl 9 i} 1 .d£jdx l = 0. 

Of the terms in this sum, those in which either j 9 or Z, or 
both, are equal to i vanish, since dx ; — 0; the other dx'a being 
arbitrary, the necessary condition for the vanishing of the other 
terms is that 

{K~t} P - 0 + %). 

We thus see the analytical meaning of condition (a). 

Next consider (6). We shall take P' on the line i, so that, if 
dx represents the increments of the co-ordinates from P to P\ 
we shall have dx t = 0 for every value of / other than L Let 
X denote the direction of the co-ordinate line j at P, bo that 
X k — 0 for every Jc other than j, and let X undergo a parallel 
displacement from P to P'. Applying the usual formula and 
remembering that dx- t and A' are the only components which are 
not zero, we get 

d\ j — — [ji, i} r X J dx t . 

In order that the direction A' = A + dX may lie on the 
hypersurface x t - constant, we must have A" ~~ 0, or (since, 
as we noted, A' = 0 if i 4^ j) dX' — 0 if i #= j, so that w T e must 
have 

(ji~i} P - 0 (t 4=i). 

This is the analytical expression of condition (6). We must 
now use the third condition in order to show that the symbols 
with three equal indices vanish; we shall thus have exhausted 
all the types of Christoffel's symbols. 

Suppose that the co-ordinates x satisfy the foregoing condi- 
tions. Apply a transformation which leaves the co-ordinate 
surfaces unchanged; this can be done by putting x f 
(i.e. every a? is a function of a single x), or, which is the samQ 
thing, 

dx { = Xi{x t )dx l7 
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where Xj denotes the derivative of / with respect to its 
argument. 

We shall now calculate the explicit expression of the symbols 
which we intend shall vanish. We have 

n 

[ii, i] == 2; j], 

or, remembering that all the symbols are already zero except 
those with three equal indices, 

[ii, i] ^= a u {ii, i}. 

Substituting on the left-hand side the expression which defines 
the symbol of the first kind, we get 

, dd tl _ / .-— — ^ 
\ d . : - «.,{»,.}. 

Hence the condition 

ij 0 (i — 1, 2, . . . n) 

is equivalent to 

(da 



(i — 1, 2, . . . n). 
Now from the law of covariance we have 

- yi () 'Xj ^ Xf, ^ Y Y 

x dx L dx k 

and therefore 

ox, ox, 

In order that the required condition may be satisfied, the 
functions X therefore need only satisfy, at P, the n numerical 
conditions 

^"X? + 2a ii X i X' t = 0; 
ox x 

otherwise they may be completely arbitrary. 

We thus see how to determine a system of co-ordinates x 
which shall be locally geodesic at P. 
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12. Seven's theorem. 

The possibility of choosing co-ordinates which are locally 
Cartesian at a given point enables us to simplify the proof of 
some geometrical properties which hold in the neighbourhood of 
a point. As an example we shall prove, without any calculation, 
a remarkable theorem due to Professor Severi. 1 

In a given V n consider two infinitely near points, P and P v 
and a direction u drawn from P. This direction, and the direc- 
tion PP X determine a section of V n , and therefore a geodesic 
surface V 2 which passes through P and P x and contains u. 

We can now give u a parallel displacement, from P to P v 
in two ways: 

(1) by considering u as a direction in V n , and therefore using 
the metric of this variety; this will give a direction Uj, which 
we shall call the awbiental parallel; 

(2) by considering u as a surface direction, belonging to the 
geodesic surface V 2 just defined, and using the metric of F 2 ; 
this will give a direction Uj. 

Severi's theorem is that vl x and uj are identical. 

We shall examine first the case in which V n is Euclidean. 
In this case the geodesies arc straight lines (since, with a system 
of Cartesian co-ordinates y, Christoffel's symbols are zero and 
the equations of the geodesies become d*y l — 0 (i 1,2, ... n)) 
and the geodesic surfaces are planes; Severi's theorem becomes 
an immediate consequence of the ordinary theory of parallelism 
in Euclidean spaces. 

Next, if V n is not Euclidean, we note that in the definitions 
of the arnbiental parallel u 1? the geodesic surface F 2 , and the 
parallel relative to V 2 , the only metrical elements used are 
Christofiers symbols for the V n \ since all these can be made 
to vanish by a suitable choice of co-ordinates, the two methods 
of displacement are applied exactly as if V n w T ere Euclidean, and 
therefore lead to the same result. 

1 "Sulla curvatura delle superficie o varieta" in Rend, del Oircolo Mat, di 
Palermo, Vol. XLII, 1917, pp. 227 2f»9. Cf. especially § 11. 
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CHAPTER VII 

Riemann's Symbols and Properties relating to Curvature; 
Ricci's and Einstein's Symbols; Geodesic Deviation 

1. Cyclic displacement and the relations between parallelism 
and curvature. 

Schouten, 1 by his vector methods, and independently of him 
P&fes, 2 by ordinary calculus methods, have demonstrated the 
great importance, for determining the geometrical properties of 
a V n , of the displacement of a direction round a closed circuit; 
in particular the importance of infinitesimal circuits in investi- 
gating local properties at a generic point P. 

Consider a generic direction (a unit vector) u drawn from P, 
and give it a parallel displacement round a closed curve I 1 of 
infinitesimal length so that it comes back again to P\ after the 
displacement we shall have a direction u x , also drawn from P, 
but not in general coinciding with u. The change in the contra- 
variant components u 7 due to the displacement round the circuit 
will in general depend on the area of the circuit, on its configura- 
tion (i.e. on the orientation in Y lt of the element of surface on 
which the circuit is drawn), and on the metrical properties of the 
V H at P. The influence of the last-named properties is exerted 
through the first and second derivatives of the a. k 's\ those 
derivatives occur in certain characteristic groups which are 
called Riemann's symbols, and which are composed of Christoffel's 
symbols and their first derivatives. In the particular case of a 
surface, these expressions reduce to one, which is that known 
in geometry as the (Gaussian) curvature of the surface; for any 
V n the consideration of Riemann's symbols provides a convenient 
way of extending the notion of curvature. 

In this chapter we shall first consider displacement round a 
particular form of infinitesimal circuit, namely, an elementary 
parallelogram. We shall then discuss some of the properties of 
Riemann's symbols, which occur in the investigation of the 

144 Die direkte Analysis zur neueren Kelativitatstheorie in Verh. dcr Kon, 
Ah. van Wet. te Amsterdam, Deel 12, No. 6, 1919. Cf. also the same writer's Dcr 
Ricci-Kalkill (Berlin, Springer, 1924), II, §§ 12-16. 

3 44 Le parallolisme de M. Levi-Civita et la courbure rieraannienne in Rend, 
delta R. Ace. dei Lined, (6), Vol. XXVII (first half-year, 1919), pp. 426-428. 
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displacement, and shall use these properties to obtain the formula 
for changing the order of two successive covariant differentiations, 
by determining the difference between the derivatives. Lastly, 
we shall return to the question of displacement round any circuit 
whatever, and shall deduce from it the notion of curvature, 
first for a surface, then for any V n whatever. 

2. Cyclic displacement round an elementary parallelogram. 

Let two elementary vectors, 8P } 8'P, be drawn from a generic 
point P of a V n . We shall interpret the first as an elementary 
displacement PP 1? and give the vector 8'P a parallel displace- 
ment along it; let Q be the position of the extremity of 8'P after 
this displacement. If we apply the same process to 5P, and give 
it a parallel displacement along the path PP 2 , we reach the same 
point Q (as we have already shown in Chapter V. p. 110), even if 
we retain terms of the type 8S'P, 8'SP, while neglecting terms 
of the second order in SP and 8'P. We can therefore, in any V n9 
consider an elementary parallelogram PP t QP 2 . 

We shall adopt the obvious convention of representing by 
8q the change in any quantity q (scalar or vector) in passing 
from P to P t and by 8'q the analogous change in passing from 
P to P 2 . For a vector, we shall calculate these changes by the 
formulae of parallelism. 

Now let D x q be the change in q on passing from P to Q 
along the path PP t Q, and D.,q the analogous change on passing 
along the other pair of sides PP 2 Q which with the first pair 
make up the circuit. 

It will be seen at once that (neglecting second order terms as 
explained above) the total change Aq on going round the entire 
circuit in the sense PP 1 QP 2 P is D L q — D 2 q. We shall first 
examine 

The change denoted by S corresponds to the displacement 
along PP 5 ; hence, if the value of our quantity was q at P, its 

value at P, will be , ~ 

q + Sy ----- q v 

The displacement along P^Q changes q x into q x + 8'q v so 
that at Q we shall have the quantity 

q + 8q + 8'(q + 8rj) 
= q + 8q + 8'q + 8'8q 
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D x q = 8q + 8'q + S'Sy. 

As 7) 2 y, by its definition, differs from D x q only by interchang- 
ing P x and P 2 , and therefore S and 8', we get 

D/l = S' ? + 8? + SS' ? . 

It follows that the change caused by the displacement round 

the circuit is A 

Aq = (S 8 — - 88 )y (1) 

We must now find an explicit form for this expression, sup- 
posing that the quantity q is a vector u, and calculating the 
increments 8 and 8' by the formulas of parallelism. By these 
formula), the changes 8u r of the contravariant components will 
be given by the Pfaffian (p. 138, equation (52) ) 

8m' - -£, h {ih,r}u'8x„, ... (2) 
i 

while the changes 8V will be given by the same Pfaffian relative 
to the increments 8'x h . From (1) we see that we have to calculate 
the bilinear co variant (cf. p. 20, § 4) of this Pfaffian. 
Differentiating (2) with the symbol 8' we get 

8'8u r = - 8' [ih. r) u l 8x h - Z fh [ih, r) 8V 8x h 
i i 

-i th {ih,r}u' 8'8x h . 

To expand the first sum, we note that the expressions [ih, r) 
are functions of the cr's, and therefore 

h'{ih, r) = S x ^--{i^rjS'x,. 
i ox k 

The second, on substituting for SV the expression analogous 
to (2), becomes 

?* 

S lAW { ih, r\{kl, i) u l Vx k 8x h , 
i 

or, interchanging i and J in order to get the factor u l here too, 
X ihkl {lh, r){ki,l)u l 8'x k 8x h . 
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We have, therefore, taking out the factors u? 8x h 8'x k in the first 
two sums, 

— 2 <A {ih, r} u?8'&x h . 

The expression for 8SV can be obtained from this by inter- 
changing 8 and 8'; in the first sum we shall also interchange 
h and k, giving 

S8V = - £ tM T- S {ik, r} ~ i t {Ih r] {hi, l)~]u> 8'x„ 8x h 

— { ih, r) u* S8'x h . 
i 

In taking the difference 8'S?^ - 88 V the third sum cancels 
out, because 88'# /} - 8'Sx,, (cf. pp. 18, 19, § 4), and there remain 
the terms involving the indices i, h, k, in which u* 8x h 8'x /c can be 
taken out as a common factor. If we introduce lliemann's symbols 
of the second kind, 

{ir, hk) ----- d {ih, r) - £-{ih r) I 

f)X * dX,t \{i,r,h,k=\,2,...n), (3) 

-S I [{» i r}{t*,l)-{K,r){tA f i}] J 

we shall therefore have 

Au r - (8'8-88> r = —Jiajt{ir 9 hk}u i bx h b 9 x k . . (4) 

i 

This formula shows that the required increment Au depends 
on the vector u, on the two vectors 8P, 8'P which define the 
parallelogram, and lastly, on the metric of the manifold, through 
the quantities {ir, hk) . From (4) it follows as a particular case 
that for Euclidean spaces Riemann's symbols as just defined are 
all zero, whatever may be the co-ordinates x chosen for reference. 
In fact, for such a manifold, we have Au r = 0 (r 1, 2, . . . n), 
since any vector resumes its original value after parallel dis- 
placement round any closed circuit whatever. Hence; the right- 
hand side of (4) vanishes for every r, and for any value of the 
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vector u and of the displacements SP, S'P, i.e. for any values of 
the arguments u', 8x h , 8'x fr The coefficients [ir, hk) must there- 
fore vanish separately. 

It will be useful to point out at once the following two pro- 
perties of the operator A: 

(a) When applied to a product, it behaves like a symbol of 
ordinary differentiation, i.e. 

A(^) - ipAcf> + <£A0, 

which can be verified directly, by calculating first 8(ift<f>), then 
8'8(ip<f>), &c; 

(6) when applied to a function of position, the result is zero, 
as is obvious from the meaning of the symbol. 

If instead of the increments Au } we wish to find those of the 
co variant components, we can use the relation 

n 

1 

and therefore, from the properties of the operator A, 

AUj ----- ^L r a Jt ,Au r 
i 

n 

— — S r/M a jr {ir, hk] u L 8x h 8'x k . 
If we introduce Riemann's symbols of the first kind, 

n 

(ij, ftk) Yi r a jr {ir, hk}, .... (5) 
i 

we can sum with respect to r. and can then write 

Au, = - —Z^i'ij, hfyu* 8x h S'x k , . . (4') 
i 

which is analogous to (4). 

Solving (5) we get Ricmann's symbols ot the second kind in 
terms of those of the first kind by the formula (the inverse of 

(5)) 

{ir, hk} = %a> r (ij, hk.) . . . (5') 
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3. Fundamental properties of Riemann's symbols of the 
second kind. 

As can be seen from their definition (3), Riemann's symbols 
of the second kind are functions of position, depending on the 
coeificients a £k , their first derivatives (contained in Christofiel's 
symbols) and second derivatives (contained in the derivatives of 
Christoffel's symbols). They have the following fundamental 
properties: 

(a) They are antisymmetrical in the last two indices, i.e. 

{ir 9 hk} = — \ir,kh) 9 (6) 

whence in particular 

{ir, hit) — 0. 

This property follows immediately from (3). 

(b) They constitute a mixed tensor, 1 contra variant with respect 
to the second index and co variant with respect to the other three, 
so that the symbol {ir, hk ) could also be denoted (as is sometimes 
done) by a r Ulfr To prove this, consider the invariant 

F = S,j» r « r , 

where the p's are given (but completely arbitrary) functions of 
position, so that Ap r - 0. If we give F a displacement round 
an infinitesimal circuit, we find (remembering the behaviour of 
the operator A) 

AF - X r (u r Ap r + p,Au r ) 
1 

91 

= ^ r p r Au r . 
1 

As F is invariant, this quantity must also be so; replacing 
Au r in it by its expression (4), we get the quadrilinear form 

= -2**, {ir, hk } p r u* hx h h'x k , . . (7) 

which expresses the required property of the Riemann's symbols, 
since the simple systems p r , u\ 8x h , h'x k are all arbitrary. 

3 Very generally, especially in works on the Theory of Relativity, called the 
Rkmomn-Christoffel tensor, 
( D 666 ) 
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We can use the tensor character of Riemann's symbols to 
obtain a second proof of the fact that Riemann's symbols are 
all zero for a Euclidean V n (the first proof is an immediate con- 
sequence of (4), as was shown in the preceding section). In fact, 
the definition (3) shows at once that they vanish in Cartesian 
co-ordinates, and in consequence they vanish in any other system 
of co-ordinates. 

(c) They have an important cyclic property with respect to 
the three indices of covariance, namely, 

{irjik} + {hr,H} + {Jcr, ih\ = 0. . . (8) 

To prove this, we again take F and formula (7), but we suppose 
that in them the p's are derivatives of an invariant function / 
of position (whose numerical values are otherwise arbitrary), 
and we also take as vector u an infinitesimal displacement with 
components u r — dx r} which is also arbitrary. With this choice 
F becomes 

F - i r d / dx t . = df, 
1 ox r 

and (7) becomes 

(8'S - 88') df - -S^ r {ir, hk} p r dx t Sx h 8'x k . . (9) 
i 

Interchanging cyclically the three infinitesimal vectors denoted 
by the operators d, 8, 8', we get the other two formulae: 

(d8 f - 8'd) 8/ = -2 ihkr {ir, hk \ p r 8x t 8'x h dx, c , (9') 
i 

{8d — d8) 8/ — {ir, hk } p r 8'x { dx h 8x Ir (9") 

Now on the right-hand side of these last two formulas we can 
arrange to have the product dx t 8x fl 8'x k in the general term, 
merely by a suitable interchange of the indices of summation. 
We can then add (9), (9'), and (9"); the left-hand side gives 0, 
since the terms cancel out in pairs (e.g. 8'8df— S'tfS/ 8'(8df 
_ d8f) = 0, since / is a function of position); and we get 

n 

0 = *L ihkr [{ir, hk} + {hr, ki \ + {kr, ih}] p r dx t 8x h 8'x k * 
i 

As p n dx i9 8x h , 8'x k are arbitrary, (8) follows at once. 
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4. Fundamental properties and number of Riemann's symbols 
ol the first kind. 

Riemann's symbols of the first kind, as defined by (5) (i.e. 
the quantities obtained by compounding the quadruple system 
of the symbols of the second kind with the system of coefficients 
a ik ) have the following properties: 

(a) They are covariant with respect to all four indices, so that 
they may be denoted, as is often done, by a ihhk \ this follows 
from the definition, in consequence of the law of contraction. 

The remark that a Euclidean V n has all its Riemann's symbols 
zero is true here too, whatever may be the system of reference. 

(6) They are antisymmetrical with respect to each pair of indices, 
so that we have identically 

(ij, hk) = -(ij, kh) 9 .... (10) 

{ij, hk) - -0"t, hk) (11) 

The identity (10) follows at once from (5), and from the 
analogous property of the symbols of the second kind. To prove 
(1J) we shall follow a method analogous to that used in § 3 (b), 
taking as invariant the scalar product of two arbitrary vectors 

F - VjV,u?. 
i 

Applying the operator A and remembering that in a parallel 
displacement the scalar product does not change (so that A F — 0) 
we get 

0 =r.- E t v,£iuf + ^jU j Av r . . . (12) 
i" i 

The expression for Au J is given by (4), by writings in it instead 
of r; that for Avj is given by (4'). Substituting, we get 

0 - % M vj { ij, hk} u { 8x h 8'x k + % ihk u> (ij, hk) v l Sx h 8'*,, (12') 
i i 

In the first sum we express Vj in terms of the contravariant 
components v r , and then, remembering (5), we sum with respect 
to j; we get successively 

n n 

£j»*r a Jr v r {ij, hk } u? 8x h 8'x k = S^ r v r (ir, hk) 8x h 8%; 
lastly, changing the indices i and r into j and i respectively (to 
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get the term in the same form as the second part of (12') ), we 

get „ 

E>» v* (ji, hk) u* 8x h 8'x k . 
i 

We can now return to (12'), and taking out the common 
factor u J 8x h 8'x k wc get 

0 = % M [(ji, hk) + (ij, hk)] v< u-> 8x k 8'x k ; (13) 

1 

from this, since u\ v\ 8x h9 8'x k arc arbitrary, we get formula 

(c) Riemann's symbols of the first kind have also a cyclic 
property analogous to that of the symbols of the second kind 
and immediately deducible from the latter, namely, 

(ij, hk) + (hj, hi) + (kj, ih) - 0, . . (14) 

whore the second index remains fixed and the other three are 
permuted cyclically. Thi.5 formula follows directly from (8), 
on multiplying by a jr and summing with respect to r. 

As each of the terms in this sum is antisymmetrical, we can 
at once obtain from (14) a similar identity 

<*/, hk) + (ih, kj) + (ihjh) = 0, . . (14') 

in which the first index remains fixed and the other three are 
permuted cyclically. 

(d) Lastly, for the symbols of the first kind, there is a pro- 
perty of per mutability, which is a consequence of the foregoing 
properties, and according to which we can interchange the two 
pairs of indices without altering the \ alue of the symbol; namely, 

(ij, hk) - (hk, ij) (15) 

To prove this, take (14') and the three other identities obtained 
from it by cyclic permutation of the four indices in the order 
i,j, h, k, or 

(ij,hk) + (ih, kj) + (ik,jh) = 0, 
( jh, ki) + (jk, ih) + 07, hk) = 0, 
(hk,ij) + (hi,jk) + (hj, ki) = 0, 
(hi, jh) + (kj, hi) + (kh, ij) = 0. 
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Adding the first and fourth and subtracting the second and 
third of these identities, and using the property of antisymmetry, 
we see that the terms cancel out in pairs, except the four under- 
lined, which give 

2(ij , hk) — 2(hk, ij) = 0, 

whence the required property follows. 

We shall now calculate the number of independent Riemanri's 
symbols of the first kind. A quadruple system (cf. § 2, p. 65) 
has in general elements, if there are n independent variables* 
The number of distinct Riemann's symbols of each kind is 
smaller, however, as these symbols are connected by the identities 
we have just proved. We shall determine this number for the 
symbols of the first kind, dividing them into three classes, and 
counting separately those in each class, as follows: 

(1) Symbols with only two different indices: these are of the 
type (ij, ij), since the other possible arrangements are either 
reducible to this, or give zero values. Each pair of unequal 
indices i, j therefore gives a single symbol of this class, which 
thus contains 

n(n — 1) 
2 ; 

(2) Symbols with three different indices: these are of the type 
(ij, ih), since here too the other possible arrangements are reducible 
to this or give zero values. Every triplet of unequal indices will 
give three symbols of this type (since the repeated index may be 

any one of the three); since there are n ^ n ^ ^l}^ — triplets, 

the number of distinct symbols of the class we are considering 
amounts to 

n(n — l)(n — 2) 

2 " ; 

(3) Symbols with four different indices: a set of four different 
indices i, j 9 h, k will give the three symbols 

(ij, hk), (ih, kj), (ik, jh), 

while every other possible arrangement gives a symbol reducible 
to one of these. But these three are not independent, on account 
of the cyclic relation (14'). It follows that each of the 
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~- — - « ■„ . { se * s of four indices gives two distinct 

1 • 2 • 3 • 4 

symbols, so that the total number of these is 

n(n — l)(n — 2)(n — 3) 
12 

Adding these three partial totals, and simplifying, we get 
the total number N of independent Biemann's symbols of the 
first kind: 

_ »y - i) 

" 12 

Thus for an ordinary surface (n = 2) we have N = 1; 
for a three-dimensional space, 2V = 6; for a four-dimensional 
space, N == 20. 

5. Bianchi's identities. 1 

Bianchi's identities are cyclic relations between the covariant 
derivatives of Riemann's symbols of both the first and second 
kinds. They are obtained as follows: 

Take formula (3), which defines the symbol of the second 
kind \ ir, hk \ , and differentiate it with respect to x t . We note, 
however, that on differentiation the last part, which consists of 
terms of the second order in Christoffcl's symbols, gives terms 
made up of the product of one such symbol by the derivative of 
another: the essential point for us is that, with reference to a 
specified point P, by choosing co-ordinates which are geodesic 
at that point, we can make all these terms vanish. We cannot, 
however, treat the first part in the same way, as the geodesic 
co-ordinates make ChristoflEeFs symbols vanish but not their 
derivatives. We shall therefore have the formula, valid at the 
point P for co-ordinates geodesic at P, 

~ iir 9 hk} = - — —{ih 9 r\ — -{ik, r\. 
cxi dx k 0Xi dx h dx t 

1 These identities were Htated without proof by Padova, on the strength of 
a verbal communication of Ricci (cf. ** ftulle deformazioni infinitesime *\ in Rend, 
detla It Aoc. dei Litwei, (4), Vol. V (first half-year, 1889), p. 176). They were then 
forgotten even by Ricci himself. Bianchi rediscovered them and published a 
proof obtained by direct calculation in 1902 (Ibid., (5), Vol. XI (first half-year, 
1902), pp. 3-7). 
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Write down also the two other formulae obtained from this 
by cyclic permutation of the indices h, k, I, leaving i and r fixed; 

A{ir, H} = ,-^-{ik 9 r) - - J* {il, r}, 
dx h K dx t dx h y J dx k dx h K ' 

• d - {ir, lh} — -* {il, r) — 5-^-- {ih, r). 
ox k } dx h dx k J dx L dx tC y 

Adding the terms on the left and right of these three equations 
we get 

hk) + - ? [ir, hi} + 8 -{ir, lh] =-- 0, (16) 

C7 X^ O Xj t C) X k 

which holds at the point P, in the particular system of co-ordinates 
chosen. Now consider the following mixed tensor of rank five 

Ami ~~~ ~ l ir > hfyt + { ir > kl )h + lh} k , 

in which the suffixes outside the brackets denote covariant differ- 
entiation. This system, referred to the point P and to co- 
ordinates geodesic at P, is identical with the left-hand side of 
(16), since in these conditions the covariant and ordinary 
derivatives are identical; all its elements are therefore equal to 
zero, and it will therefore be identically zero whatever may be 
the system of reference (cf. Chapter IV, p. 84). We have thus 
proved the identity 

{ir, hk) l + {ir, kl} h + {ir, lh) k 0, . (17) 

which is a first form of the result established by Bianchi. 

For the symbols of the first kind the analogous relation is 
easily proved from the definition of these symbols given by (5). 
In fact, taking the covariant derivative of this formula, and 
using Ricci's lemma, we find that 

n 

(ij, hk) t = T, r a jr [ir, hk} t . 
i 

From this, permuting cyclically the indices h, k, I and Bum- 
ming, we get, by (17), 

{ij, hk) t + (ij, kl) h + (ij, lh) k = 0, . . (17') 



which is the second form of Bianchi's identity. 
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6. Commutation rale for the second covariant derivatives. 

An important application of Riemann's symbols occurs in 
the formula which gives the relation between the two systems 

obtained by double covariant differentiation from a generic 
tensor where (i) Rtands for the aggregate of indices of 

covariance i x . . . i m and (j) for the aggregate of indices of contra- 
variance j x . . . To simplify the formulae we shall consider a 
mixed double system A J i9 with the remark that the procedure 
is similar if there are more indices. 

We start as usual from the bilinear form 

F = %Ai^u J9 

the invariance of which determines the law of transformation 
of the A'&; the f s are the contravariant components of an 
arbitrary vector I*, and the ufs are the covariant components 
of another arbitrary vector u. The procedure will consist in 
calculating, in two different ways, the quantity AF correspond- 
ing to a cyclic displacement round an elementary parallelo- 
gram (cf. § 2), and in equating the two expressions so 
obtained. 

A first way of calculating AF is as follows. We associate the 
increments 8, S' with two sides of the parallelogram, as in § 2, 
and use (1). Note first that from the definition of the covariant 
derivative (cf. Chapter VI, p. 146) SF is given by 

SF = J^Afatfu^ 

Similarly, applying the symbol 8' to this expression, we 
shall get 

From this, interchanging 8 and 8', we get 
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subtracting these two equations, after interchanging the indices 
h and k in the second, we get 

AF = i ljhk {A{ m - A{ m ) t uj 8x h S'x k . . (18) 

The other method of calculating this quantity consists in 
applying the operator A directly to the expression for F. Remem- 
bering the fundamental properties of A (§ 2) we shall get 

AF = £ IJ A{(u J A$ i + $ l Au } ), 
i ' 

or, substituting for A£' and Au } the expressions given by (i) 
and (4'), 

AF - = A{ { li, hk} Uj ? S*„ 8'x k 

- Z ijjlhk A[ (pj, hk) u* £' 8x h 8'x k . 

1 

In order to get the second sum in the same form as the first, 
we shall express it in terms of symbols of the second kind and 
of the covariant components of u; to do this we first use the 
property of antisymmetry with respect to the first pair of indices, 
and express the w ;,, s in terms of the u/s, so that the sum becomes 

+ Km, 4 (jp, hk) af*i' u t 8x h 8'x k . 

1 

Summing w r ith respect to p, and using (5'), we get 
i 

Now, in order to reconstruct the second expression for AF 
in a suitable form, we shall first interchange some indices, so 
as to be able to take out the factor £ 1 Uj 8x h 8'x /f which occurs 
in (18). We must interchange I aud i in the first sum, and / and 
j in the (modified) second; we shall then get, collecting both sums 
under a single summation sign, 

AF --• {il, hk) - A\ M}j f«>,8^ 

(DC55) " 7* 
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Comparing this expression with (18), and remembering that 
5, u, the 8x's, and the S'x's are arbitrary, we get the commutation 
formula 

Am ~ 4m = -S. **\ ~ **}]• ( 19 > 

If the system from which we start has m indices of covariance 
and jlc of contravariance, we must consider m vectors \> deter- 
mined by their contra variant components, and /x vectors u, deter- 
mined by their covariant components; by an analogous process 
we shall find 

(20) 



7. Cyclic displacement round any infinitesimal circuit. 

We now return to the order of ideas interrupted at § 2. Given 
a direction u at a point P, we shall give u a parallel displacement 
(in V n ) round a closed curve T, infinitesimal, but of any shape 
whatever, and of course passing through P; we propose to 
calculate the change Du? in a generic parameter of u caused 
by the cyclic displacement. The formula we shall find will be 
merely a generalization of (4), and must reduce to (4) 
if as a particular case we take for T an infinitesimal parallelo- 
gram. 

For an elementary displacement dx h we know that the change 
in u r is 

n 

du r — S t7t {ih, r}u i dx h9 
i 

so that it has the form of a Pfaffian 

u 

$ r = X h X rh dx h , .... (21) 
i 

in which the -X*s are functions of position (since they contain 
Christoffers symbols) and of the w*'s which are defined along 
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the curve T by the equations of parallelism du r = *ft r . We have 
to calculate the integral 

DvT = / 0 r = / i h X rh dx h , . . (22) 

where we use the operator D to indicate the increment resulting 
from displacement right round the circuit. We shall now con- 
sider any surface o- containing the curve T, and we shall call 
r the region of this surface which is within T, and is such that 
T constitutes its complete boundary. We propose to transform 
the integral round the circuit T, which occurs in (22), into an 
integral over the surface I\ To do this, we shall first introduce 
a system of co-ordinates q 1 and q 2 on the surface in question, 
defining them by the parametric equations 

^ ? 2 ) (* = 1> 2, . . . n). 

The dx's will consequently be linear functions of dq t and 
dq 2 \ substituting their expressions in the Pfaffian (21), this will 
take the form 

du r = t/r r - Q 1 dq 1 + Q 2 dq 2 , . . (21') 

where the quantities Q x and Q 2 , like the X's, are defined along 
the curve T. The integral to be calculated will thus be 

Du r = [ (Q 1 dq 1 + Q 2 dq 2 ). . . (22') 

T 

We shall suppose the curvilinear coordinates q l9 q 2 so chosen 
that the sense of integration round T is the same as that deter- 
mined on r (at a generic point) by the rotation (through the 
angle less than 180°) from the positive direction of the line q x 
(i.e. in the sense of q x increasing) to the positive direction of 
the line q 2 . 

The transformation of the line integral (22') into a surface 
integral taken over T could be effected at once if the Q's were 
defined as functions of position in the interior of T as well as on 
its boundary. But instead of this they contain the u r9 a, which 
are given at P, and at points on T have values resulting from 
the parallel displacement along T itself, but are not defined for 
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a point M within I\ their values at M depending on the path 
followed in the parallel displacement of u from P to M. We 
shall, however, show that if Y is infinitesimal the influence of the 
path of displacement on the values of the u r 'a at M is negligible, 
and therefore we may consider the vt 's, and in consequence the 
X 9 b or the Q's, as functions of position over the whole area F, 
which will enable us to make the required transformation of 
(22').' 

We shall make some preliminary remarks on orders of magni- 
tude, using for this purpose the general existence theorem for 
integrals of systems of ordinary differential equations. Such a 
system is constituted by the equations 

du r — ifj r (r — 1, 2, . . n), 

which define the functions u r , along a generic line T 7 , start- 
ing from a given set of initial values at P (cf. Chapter II, 
p. 23). 

Now the existence theorem assures us that in general (i.e. 
when certain not very restrictive conditions of continuity 
and differentiability are satisfied) the initial values define the 
integrals uniquely, and these integrals and their derivatives 
are continuous functions for values of the independent variable 
within an interval which is not shorter than some assignable 
quantity. 

In our case, granting, what will naturally be the case, that 
the coefficients a lk of ds 2 and the reciprocals a lk are finite and 
continuous, as well as their first and second derivatives, in a 
certain region round P, and supposing also that the length of 
the vector u at P is limited, i.e. is not greater than some specified, 
but arbitrary, constant U, it can easily be deduced from the 
above-inentioned existence theorem that — considering the arc 
of the curve of displacement as the independent variable— the 
u r 'a are defined (as continuous, differentiable, &c, functions), with 
P as starting-point, along any curve whatever, for a segment 
of the curve of length not greater than a certain quantity A 

1 We shall in fact here limit the discussion to an indication of the general lines 
of the argument, without pausing over the details needed to justify the. various 
Btepa of the proof with complete rigour. There is an exhaustive proof in the 
article by H. Tiktzk: "TJeber Parallelversehiebung in Riemann'schen Ratimen", 
in Math. Zeitsckrift, Vol. 16, 1923, pp. 308-317. 
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which depends exclusively on the metric of the manifold and on 
U. Hence it follows in particular that the differences between 
the u r9 s and their initial values are of the same order 1 of magni- 
tude as the length L of the arc along which the integration of 
the system du r --= </r, is effected. 

Further, the area P which we are considering on the surface 
a is infinitesimal, in the sense that we propose to make it diminish 
indefinitely. It is therefore perfectly legitimate to suppose that 
it is already so small that every point in it can be reached from 
P by a line of length not greater than A and that the length of 
the whole contour T is also less than A. 

We thus have the result that if u undergoes a parallel dis- 
placement from P to a point M within the area P, or on its boun- 
dary T, the u r 's at M differ from their values at P by quantities 
of the same order as L, if Z(<TA) is the maximum length of 
the lines considered. As a first approximation, in which quan- 
tities of the same order as L are neglected, we can therefore 
take the components a' as constant and — tT (0) over the whole 
area T, including its boundary. 

We can find a closer approximation if we calculate the u r9 s 
at HI by integrating the Pfaflian (21) along a curve PQ, sub- 
stituting, however, for the coefficients X, h their values at P. 
This process involves an error of order L in the values of these 
coefficients, and therefore an error of order L 2 in the values 
found for the u' \s at M; the choice of the curve PM is indifferent, 
as in this case the Pfaflian becomes an exact differential. As 
the coefficients are constants, the integration can be at once 
effected, and will give linear functions of the x\s for the vl 's. 
We shall thus have obtained the u r9 & as functions of position at 
all points (including the boundary) of the area P, neglecting 
quantities of order 

We can get a third approximation by substituting these 
approximate linear expressions of the u 1 's in the coefficients 
X, h . The Jf's will thus become functions of position, as was 
required, defined throughout T, including its boundary T, and 
coinciding on T (if we neglect L 2 ) with their accurate values 
as already defined. These values of the JSC's can be used to 

1 This means that the differences in question are not greater than the product 
of h by a certain finite coefficient, which dociH not depend on L or on the curve of 
integration, but only on P, If, and the metric of the manifold. 
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calculate the integral (22), which will give the value of Du r , 
the error being now of order ZA It was necessary to carry 
the approximation thus far, since the other two approximations 
make tfj an exact differential, and would give the value zero for 
Du r 9 the obvious meaning of this being that Du r is a quantity 
of order L 2 . 

We shall therefore return to (22), giving the X 9 s the meaning 
just explained, and transform it into the form (22') by using the 
parametric equations of the surface a\ Q x and Q 2 will now 
represent functions of position defined at all points of I\ In 
consequence 1 we can transform the line integral into a surface 
integral, getting 



We must now find the value of the integrand on the right of 
(23), which can be done by means of the following considera- 
tions, without writing out the expressions for Q x and Q 2 at full 
length. 

Let the operator 8 denote the increment of a quantity corre- 
sponding to a displacement along the line q 2 = constant, when 

1 By the ordinary formulae, 



where / is a function of q x and which is continuous, together with its first 
derivatives. Usually in these formula; (j { and r/ 2 are interpreted as Cartesian 
co-ordinates in a plane, and the senso in which the curve T is described is defined 
by the condition that the pair of directions *, n (a the tangent to the boundary in 
the sense in which it is described, n the normal to T drawn inwards) is congruent 
in the plane with the pair 71, 72. The formula* obviously hold, however, indepen- 
dently of this interpretation in plane geometry, and can therefore be applied even 
if 7it 7a are ft,| y curvilinear co-ordinates whatever, the sense of describing the curve 
being determined by an analogous criterion to that just explained, provided that 
we introduce, at a generic point of the Ixmndary curve, the directions tangential 
to 71 (i.e. 72 = constant) and 7* in the sense in which the respective parameters 
increase. Now we have already supposed (p. 187) that the auxiliary curvilinear 
co-ordinates 71, 7.. behave as regards sense in precisely this way. Hence the 
equations {O) hold both in magnitude and in sign. 




or also 
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q t increases by dq^, similarly, S' corresponds to an increase dq a 
in j 2 alone. We shall therefore have 



(24) 



and analogous expressions for any function of position. Now 
note that in (21') the first term represents precisely the incre- 
ment of u r due to a displacement of the first type (dq 2 — 0), 
so that 

Su r = Q x dq ly 

and, the second term having an analogous meaning, 

S V = Q 2 dq 2 . 

We can therefore write (23) in the form 

= f r [S8'« r — S'Sw r ]; 
and from this, remembering equation (4), we get 

/" 
r i 

Now let us put f , 7) for the parameters (in V n ) of the lines 
q 2 constant, q x — constant, i.e. 

& _ S^i i _ 8 ' x % 

where Ss, 8's are the lengths of the displacements, along the 
lines q 2 — constant, q x = constant, whose components are Sx h 
8'Xi. Then we can also write 



Du 



r — / r fr S« 8's (25) 
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where for shortness we have put 



We can now remark that if & is the angle between the co- 
ordinate lines q v q 2 , the element of area (cf. Chapter V, p. 99) is 

dF =: 8s 8's sin&, 
and therefore (25) becomes 

Du r - ( f' dr. 

J r sunt 

By the mean value theorem, if \-~^ 1 denotes the value 

- Lsm&.U 0 

of the function of position at a suitable point M 0 (not known 

sin* 

a priori) within F, and DF the area of the region T, we can also 
write 

Du r - fA I DF. 



Lsini> 

f 

Now the value at M 0 of the function of position Jr differs 

sm<9- 

from its value at P by quantities of order L (since the distance 
M 0 P is of this order); since the area DF is of order L 2 , the error 
caused by substituting the value at /-* lor that at M is of order 
L 3 , which we have agreed to neglect. We shall therefore have 

Du r = -f- DF 9 
sm& 

d r n 

or Du r - , ~ Z lhb { ir, hk) u f £ " v *. . . (26) 

sin* i 

In this formula the quantities £ ft and rf represent the para- 
meters of the lines q x , q 2 at P, and i> is the angle between these 
lines; the values of the w"s and of the Riemann's symbols refer 
to the point P. It will be seen that the influence of the circuit 
of displacement appears in this formula in three geometrical 
elements, which serve substantially to determine the circuit 
itself, namely: two directions ^, rj (a priori any whatever) which 
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determine the section on which the circuit is supposed drawn, 
together with the angle between them; and the area DT of the 
circuit itself (measured according to the metric of V u ). 

8. Peres's formula. 

From (26) we immediately get the fundamental formula which 
serves as a link between parallelism and curvature. 

Take any (fourth) direction v drawn from P. Let a be the 
angle between v and u, and a + Da the angle between v and 
u + Dn; we propose to calculate Da. To do this we take the 
scalar product 

u X v = cosa 

(assuming that u, like v, is a unit vector), and differentiate with 
the symbol D, remembering that Dv --- 0 since v is a fixed 
vector. We get 

v X Du — sina Da, 
or, substituting for the scalar product on the left its expression 

n 

S r v r Du r , and using (26), 
1 

DY n 

sina/)a = — . - E M {ir, hk} u { v r f h rf. 

If in this formula we express the w/s i n terms of the t^'s, and 
sum with respect to r (remembering (5) ), we get 

sina Da = - " Z tjhk hk) u* v> ? if. 

Now if the directions u, v coincide or are opposite, this formula 
reduces to an identity of no interest, since on the left we have 
sina ~~ 0, and the right-hand side vanishes from the antisym- 
metry of the Ricmann's symbols. Excluding this case, we can 
divide the whole equation by sina, and we get PMs's formula 
nr " 

Da-^ .---S^^ttJfiVfV- • (27) 

sina sin9< 1 

9. Application to surfaces. Gaussian curvature of a F 2 . 

In considering the particular case of a V 2i i.e. an ordinary 
surface, the directions u, v must of course be contained in the 
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section (the only one there is) defined by vj. Since the only 
purpose of these last two vectors is to specify the section on which 
the circuit is drawn, we can make them coincide with u and v 
without loss of generality; (27) will then become 

Da = S UUc (ij,Kk)u t v'i*f. . . (27') 

sin^a i 

Since for n ~ 2 Riemann's symbols which do not vanish 
are represented by the single arrangement of indices (12, 12), 
this formula can be further reduced to 

Da = , ^ L - (12, 12) (utv 2 — u^v 1 ) 2 ; 
sm 2 a 

or finally, remembering the expression for sina in terms of the 
parameters of the two directions u, v (cf. Chap. V, p. 94, formula 
(9') ), we get 

DV 

Da = — (12, 12). 
a 

It is usual to put 

<- 12 ' K, (28) 

a 



so that the foregoing formula becomes 

Da 



DT =- K - (29 > 



From this it will be seen that the function of position K 
defined by (28) is an invariant; it depends on the coefficients 
a lJt and their first and second derivatives, and is identical with 
the quantity which in the theory of surfaces is known as the total, 
or Gaussian, curvature (the product of the curvatures of the prin- 
cipal sections). 1 The equation (29) can be put in a more instructive 
form if we introduce the angle of parallelism e, i.e. the angle 
between u and u + Z>u (or between u and its parallel obtained 
by displacement round the circuit), measured in the sense in 
which the circuit is described. We can also say that € is the angle 
through which u has been rotated as a result of the cyclic dis- 

1 See also below, Chapter IX, p. 261. 
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placement. It is obvious that € has the same absolute value as 
Da, but we shall see that as regards sign the precise relation is 

c = —Da. 

To show this we need only remember the convention adopted 
above (§7) that the circuit 
is to be described in the 
positive sense with respect 
to the co-ordinate lines q l9 
q 2 , or from the versor £ 
to the versor tj. As § an( i 
now coincide with u and v 
the sense in which the cir- 
cuit is described is from u 
to v (through the convex 
angle). Accordingly (see 
fig. 3) if Da > 0, u + Du 
is outside the convex angle 
uv, and is therefore reached 
from u by moving in the 
negative sense (e <c 0), and 
write (29) in the form 

€ 

Dr 




Fig. 3 



vice versa. We can therefore 



= K, 



(29') 



which gives the following important interpretation of the curva- 
ture K : K is the ratio of the angle of parallelism (taken with its 
proper sign in relation to the sense of describing the circuit) 
to the area of t)ie circuit. 

In the case of a Euclidean V 2 Tliemann's symbol is zero 
(of. § 4) and therefore K — 0. This can also be deduced from 
the geometrically obvious fact (already used in § 2) that the 
parallel displacement is integrablc (i.e. that, the result does not 
depend on the curve of displacement). 

10. Riemannian curvature of a V n . 

If instead of a surface we consider any V n whatever, the 
notion of curvature becomes less simple. If P is a fixed point 
of the V n , then with every section through P determined by two 
arbitrary directions %, tq drawn from P we can associate an 
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invariant K, which is called the Rwmannicm curvature of the 
V n at P with respect to the section considered. Following Rie- 
mann, we construct the geodesic surface determined by the 
point P and the two directions yj, and then take the Gaussian 
curvature K of this geodesic surface as the curvature of the 
F n at the point and in the section in question. In general the 
Riernannian curvature differs in the different sections. 

The foregoing considerations enable us to give another impor- 
tant definition of the Riernannian curvature, and to find the 
analytical expression for it. 

Given the elements P, yj, construct the geodesic surface 
g defined by them and consider an infinitesimal circuit on g, 
passing through P, of area DF. Take one of the given directions, 
Say an d give it a parallel displacement with respect to the 
surface g round the circuit, in the sense % — yj. Now calculate 
by P^rfes's formula the change Da in the angle between ^ and yj, 
i.e. the difference between its values before and after the dis- 
placement. The curvature K will then be given by (29). Now 
from Seven's theorem that an infinitesimal parallel displacement 
with respect to the surface </ (using the metric of g) can be replaced 
by the analogous infinitesimal parallel displacement in V n , it 
follows at once that this method of calculating K does not really 
involve the use of the geodesic surface g. Hence the Riernannian 
curvature K can be defined as the ratio (with sign cJianged) of 
Da to DT, where Da is the change in the angle between the given 
directions 5* yj caused by the parallel displacement in V n of one of 
these directions round an infinitesimal circuit of area DP, belonging 
to the section *l, and described in the sense ^ — - tq. We therefore 
have 



as in the V 2 . 

The explicit expression for K corresponding to this can be 
obtained from ('27'), and we get 



K 



Da 



(30) 




(31) 



The symmetry of the right-hand side in u and v provides 
formal confirmation of the fact that it is immaterial whether 
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we displace u or v, as we get in either case the same value for 
Da and the same value for K. 

A third definition of the Riemannian curvature can be 
obtained from the proof of the following lemma. 

Take any three points P, P', P" on a Burface F 2 , and join 
them in pairs by arcs of geodesies, forming a so-called geodesic 
triangle. If </>, <j> <f>" are the angles of this triangle, the quantity 

€ =■ </> + f + tf>" - 77 . . . . (32) 

is called the geodesic excess. 

Now take any point on the triangle, and give a parallel dis- 
placement round the triangle to the direction of the side at 
that point, or of one of the sides if the point is a vertex; e.g. 
starting from P, the direction u at P of the side PP' (taken in 
the sense P — P'). 

We want to show that the angle between the initial and 
final positions of u (measured from the initial position in the 
sense which at each vertex is dependent on the sfcnse PP'P" 
in which the circuit is described), i.e. the angle of 'parallelism 
(relative in this case to a circuit of a special kind, but without 
the restriction of being infinitesimal), is the same as the geodesic 
excess e. 

For the proof we shall follow u in a cyclic displacement 
round the circuit PP'P", noting in the first place that from 
P to P' u remains tangential to the side PP', on account of the 
autoparallelism of geodesies (cf. Chapter V, p. 101). On arriving 
at P', u is tlms inclined at an angle tt — <f>' (outside the triangle 
at P') to the side P'P" (in the sense indicated by these letters); 
more precisely, u is behind the tangent to the side P'P" by an 
angle of tt ~ <f>\ the sense of rotation at P', as we have already 
said, being determined by the sense of description PP'P". In 
the displacement from P' to P" this angle remains unchanged; 
at P" there will be a further loss of tt — <f>" (with respect to 
the new side P"P)\ and finally at P yet another loss of tt — <j> 
(with respect to PP'). Taking all these together, we see that 
in its final position the parallel to u has been rotated away from 
its initial position through an angle of 37r — (<f> + <f>' + <f>") 
in the negative direction, or e — - 2tt in the positive direction. 
Now in the pencil of directions at a point an angle is determined 
geometrically by a quantity of the form G + 2tt7r, where n is any 
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integer; hence we have proved that one value of the angle of 
parallelism is the geodesic excess e. Now in a Euclidean space 
the geodesic excess is zero, and in any manifold whatever, for 
an infinitesimal triangle, the excess is infinitesimal; we therefore 
see that for reasons of continuity the value adopted for the 
angle of parallelism is in fact the most suitable, being that which 
tends to zero with the triangle. 

The lemma just proved is rigorously true in a V 2 , whatever 
may be the magnitude of the geodesic triangle considered. If 
in particular we apply it to an infinitesimal triangle, we can 
substitute c for — Da in (29), so getting 




a formula which defines the Riemannian curvature as the ratio 
of the geodesic excess to the area of an infinitesimal geodesic triangle 
lying in the section considered, and having one vertex at the given 
point P. It will be seen that this is an obvious extension to n- 
dimensional manifolds with any metric of an elementary theorem 
in spherical geometry (the area of a spherical triangle = the 
spherical excess X the square of the radius); the latter 
theorem, however, unlike the former, holds also for a finite 
triangle. 

We must confine ourselves to a mere reference to the 
important researches of Professors Schouten 1 and Bompiani 2 
on the simultaneous cyclic displacement of several directions, 
and even of the whole sheaf of directions drawn from a single 
point. Their work throws light on the theory of Riemannian 
curvature under various new aspects. 

11. Case of a F 3 . The tensors a n , of Ricci and G lk of Einstein. 

For a manifold of three dimensions the symbols of the first 
species (ij, hk) which do not vanish reduce substantially, in virtue 
of (10), (11), and (14), to the scheme 

(i + 1 i + 2, k + 1 k + 2) (i, k - 1, 2, 3), 

1 See references given in note (1), p. 172. 

*" Studi augli spazi eurvi'\ in Atti del R. 1st. Veneto. Vol. LXXX, 1921, pp. 355- 
386, 839-859, 1H3-1140. 
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with the convention (p. 161, § 9) of regarding as equivalent 
two indices which differ by a multiple of 3. 

Using this notation, we now introduce after Ricci the double 
symmetric system 

a . = << + 1 i + 2, h+ 1 k + 2) . k = 2j 3)> (33) 
a 

which constitutes, as we shall now show, a contravariant tensor. 
In fact, if wc make use of the contravariant system € (Chapter 
VI, p. 158) we see that (33) is equivalent to 

o- 1 " = «*" (W, r*) (t, i = 1, 2, 3): . (33') 

1 

which proves the assertion. The verification of this last formula 
is immediate, when we remember that of the various determina- 
tions a 'priori possible for the pair p, q 9 there are only two, 

p ^~ i 1, q = i + 2 

and p — i + 2, 9 = i + 1> 

corresponding to which e <ptI has a value different from zero, 

viz. in the former case, — -~ in the latter. Similarly, for 

the other pair r, s, the only determinations to which there corre- 
sponds a non-vanishing e fr '" A arc 

r - k + 1, 5 /c + 2 
and r =- & + 2, 5 = k + 1. 

The sum ^ pqrH reduces therefore to four terms all expressible as 

(i + 1 i + 2, k + 1 k + 2) 

-- - 9 

a 

thus giving the result stated in (33). 

The name Ricci 9 s symbols is sometimes given to the a l7r> s 
just defined, or, more specifically, to the elements 

a ih = i }h a fj a hk a jh (i, k = 1, 2, 3) . (34) 
1 

of rfZ?e reciprocal covariant tensor. 
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It is worth noting that equations (33), if we take account of 
the properties of Rieniann's symbols in respect of symmetry 
and antisymmetry, can be solved so as to give the values of these 
symbols in the form 

(V,U) = i vp a^€ viJ € phk . . . . (33") 

In fact, expanding the second member and attending to the 
definition of the covariant system c, we find the value zero corre- 
sponding to every set of four i, j, h, 7c for which the first member 
vanishes, and the value aa ttc corresponding to a set of four of the 
type i + 1, i + 2, k + 1, h + 2; (33") therefore follows from (33). 

By contracting the Ricci tensor a by means of the funda- 
mental tensor (the coefficients of ds 2 , or their reciprocals) we 
obtain the linear invariant of the tensor a 

M - i Uc a ik a* ^ k a' l a a . . . . (35) 
i l 

We may point out another formal relation of which xiae will be 
made in Chapter XII. For any V n whatever we can derive from 
the Riemannian tensor, by contraction with respect to two in- 
dices, the covariant double tensor 

Q ik = % h a> h {ij,hlc) (36) 

i 

which, in virtue of equations (5), can also be written in the 
form n 

= S A [«,**} (36') 

This tensor was noticed by Ricci, who applied it to the study of 
the local distribution of curvatures in a V n ; it was afterwards 
taken up by Einstein, who gave it a fundamental place in the 
theory of relativity (in which n — 4): it is commonly known 
as the Einstein tensor. 

For n — 3 the a ( ,/s are related in a simple way to the firm's. 
To bring out the connexion simply and neatly, it is convenient 
to make use of two properties of the ternary systems e: one 
expressed by the identity (which can be verified immediately) 

Zj*"** - - S;'S? (p, q 9 r, s - 1, 2, 3), . (37) 
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the 8's having the usual meaning (0 for different indices, 1 for 
equal indices); and the other translating, as it were, the definition 
of a jh as a reciprocal element 

Substitute in equations (36), in place of the a ih9 s, the second 
member here; and, in place of the (y, /*7c)'s, the expression for 
them in {33"), with € phk changed into — € pl . h . Taking account of 
(37) we find 

a* - 1 k m <v <v ^ («r 8 i - to 8 p - s ; s *>- 

Of the four terms obtained by expanding the product of the 
bracketed expressions, the two which are positive are merely 
interchanged by interchange of p with q and of r with s 9 which 
does not alter the product a pr a (JR ; and similarly with the negative 
terms. We can therefore confine our attention to one term only 
of each kind, and suppress the factor J. If we take, e.g. 

8*87 (S! 8* - 8' 81), 

and bear in mind the meaning of the symbols 8, we find that the 
sum reduces to one with respect to v and p only, giving 

3 

G Uc =■= V (a^a^ar—a^a****), 

or finally, having regard to (34), (35), 

G,t = -Jlf*ut (*> * = 1, 2, 3). . (38) 

12. Curvature of a manifold of three dimensions around a 
point. Principal directions and invariants. 

Consider in a F 3 a generic section or facet, f 9 defined by two 
directions (versors) u, v, whose parameters are u i 9 v l (i — 1, 2, 3), 
issuing from the same point P. Let w be their vector product 
(Chapter VI, p. 159), the moments of which are 

w v - EflC^wV (v 1, 2, 3). . . (39) 
Corresponding to the section f in our manifold V$ we have the 
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Riemannian curvature K given by (31), with n — 3. In the 

sum 2^ of (31) it is convenient to introduce, in place of Rie- 

i 

mann's symbols (ij, hk) the expressions for them given in (33"). 
Taking account of (39), we find immediately 

K = -1 ia^w,w p . . . . (40) 
sira i 

Deferring for a moment the illustration of this formula, we recall 
the fact that in general the length w of the vector u A v (the 
vector product of u and v) is given by the product of the lengths 
of u, v by the sine of the angle between them. For the Euclidean 
V 3 this implies, as already mentioned (Chapter VI, p. 159), the 
identity of the moments (39) referred to Cartesian co-ordinates with 
the ordinary components (orthogonal projections) of the vector 
product u A V. For a general F 3 , it is sufficient to remember 
that we can choose co-ordinates which are locally Cartesian at 
any assigned point P, so that the a ik '$> have the values Sf (and 
so that also — though this is not important for the present pur- 
pose — their first derivatives vanish). Now both in the measures 
of the lengths and the angular distances of vectors proceeding 
from the point P, and in the definition (39) of the w„'s, there 
enter only the components of the vectors and the values of the 
a ik & at P. Locally then everything is the same as for Euclidean 
space. 1 

Turn now to our case, in which u and v are unit vectors. It 

follows that w = sina, whence — = !~ constitute the 

w sma 

1 We can also of course calculate the length of w. Thus we note that, )>esideB 
(89), wc have the equivalent formula* defining the contravariant components: 

Hence, in view of (37) (the index of summation in which we may suppose 
transferred to the first place), 

8 8 3 8 

= St w» S ; vJ vj - E* u* vi Si v$ ut 
l 11 l 

a- M 2 t fi _ ( uv cosa) 2 = u 3 v a sin 2 a. 
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moments A; of the direction X normal to the section /. The sense 
assigned to the normal by (39) is characterized by the system 
e or, geometrically, by the sense of the trihedron formed at P 
by the positive directions of the co-ordinate lines. In fact, from 
these equations (39), if we suppose, for example, that the lines 
I, 2 are taken in the directions u, v, there results 

sj a ~ 

Wl = W 2 =0, W 3 = — r --= > 0, 

v a n 0,22 

so that (Chapter V, p. 127) w makes an acute angle with the 
positive direction of the line 3, whose parameters are 

s\= s*= 0, 4 = * >0. 

va 33 

Thus X is perpendicular to / at P, and directed so that the 
trihedron u, v, X has the same sense as the trihedron formed 
by the positive directions of the co-ordinate lines at the same 
point P. 

Equation (40) now takes the form, given by Ricci, 

K = S^a^A, = E^A'A', . . (40') 

which defines the curvature of a variable section f through P as a 
Jiomogeneous quadric in the parameters A'' (or in the m,oments X v ) 
of the normal to f , the sense of the normal being indifferent, since 
the expressions in (40') do not change when we change the signs 
of the A's. 

The dependence of K on the direction X of the section with 
centre at P is of purely local nature. We can therefore, in accord- 
ance with an observation made above, make use of the elemen- 
tary criteria of analytical geometry just as if it were a question 
of ordinary space — we have only to take co-ordinates which are 
Cartesian and orthogonal at P. The A"'s then become direction 
cosines, and we have for K (except for a different signification 
of the coefficients a ) the same expression as the one which 
characterizes the distribution of moments of inertia (of an assigned 
material system) with respect to the oo 2 axes coinciding with the 
lines of the versors X proceeding from P. As we know, the law 
of variation of K becomes expressible geometrically if we intro- 
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duce the ellipsoid of inertia E, of centre P: to any direction of 

X there belongs a value of K> viz. whero Q is the intersection 

PQ 2 

of the line of X with the ellipsoid E; the three axes of E corre- 
spond to stationary values of K (with respect to neighbouring 
directions), the axes of maximum and minimum length in parti- 
cular corresponding respectively to minimum and maximum 
values of K. 

That being so, the same law of variation will hold for K the 
curvature; but in the general V% interpretation is necessary. 
This implies merely that the ellipsoid E (like, for example, 
Dupin's indicatrix for the case of an ordinary surface) takes us 
in thought outside the F 3 , as an auxiliary representative element 
to be associated with the Euclidean three-dimensional space 
which is tangential to the F 3 at P when, as is always allowable 
(Chapter V, p. 121), we imagine the F 3 immersed in a Euclidean 
S N (N > 6). 

The outstanding result is that there exist at evert/ point P at 
least three mutually orthogonal directions X 1? X 2 , X 3 to which (or 
rather to the normal sections perpendicular to which) belong 
eurmtures tvhich are stationary with respect to those of adjacent 
sections. These directions are called principal directions of curva- 
ture, and the corresponding values co,, o> 2 , a> 3 of K principal 
curvatures. In general, that is when the three a/s are distinct, 
the principal directions are uniquely determinate (ellipsoid with 
three unequal axes); when two principal curvatures are equal 
but differ from the third (ellipsoid of revolution), e.g. co 2 = 
a> 2 4= o> 3 , only the principal direction X3 is uniquely determinate, 
while every pair of directions \ y X 2 orthogonal to X 3 and to each 
other can be considered principal. If the three principal curva- 
tures coincide (sphere) the curvature K is the same tor all sections, 
and every set of three mutually orthogonal directions is a prin- 
cipal set. 

All this of course can be established by purely algebraic 
methods: we have only to avail ourselves of the theory of quad- 
ratic forms and their transformations. Let 

i^iya^X, ^ = S y a ty A l A^ . . (41) 
be two quadrics, one of which at least is definite, say <f>; the inde- 
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pendent variables being A*' (i = 1* 2, . . . n). Then the following 
facts are well-known 1 : 

(1) If we consider the ratio 

co — - — 

as a function of the A's, and look for the values of these variables 
which make 8a> = 0, we are led to the system of n linear homo- 
geneous equations 

£, (a (J - toaj \ j = 0 (t = 1, 2, . . . n); . (42) 

these are satisfied by values not all null of the A's if and only 
if the determinant of the coefficients vanishes, so that the oi's 
are roots of the equation of degree n: 

\\a l} ~ok* v || = 0, .... (43) 

called the characteristic equation. 

(2) If A* 9 A' 4 are two solutions of equations (42) corresponding 
to two distinct roots co, to' of the characteristic equation, there 
exists between them the relation (of orthogonality) 

E fJ a tJ A' A'' = 0. 
i 

(3) The characteristic equation (43) has its n roots co ly co 29 . . . co n 
all real (distinct or coincident). 

(4) To each simple root co h of (43) corresponds, in the mani- » 
fold V n the ds 2 of which has the a^B as its coefficients, one and 
only one direction X /t whose parameters A£ satisfy (42) for co — co h . 
With each root of multiplicity fx (> 1) we can in an infinite 
number of ways associate /x mutually orthogonal directions in 
V n , whose parameters are independent solutions of equations 
(42) when we give to the value of the said multiple root. 

From all this there results that it is possible in every case 
to set up at least one set of n mutually orthogonal directions 
Xfc (h — 1, 2, ... n), (uniquely determined in the general case, 

1 Fur proofs, see for example: Bianotu, hezioni di geometria analilica, 
Appendix; (Pisa: Spoerri, 191 5): or Jlitomvim, Quadratic Forms and their 
Classification , . . (Cambridge tr&ctn on Mathematics ... No. 3). 
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in which the roots of (43) are all simple), the parameters X j h of 
which satisfy the system (42), with o» equal to the corresponding 
<*>k- 

By means of these X A 's we can obtain the so-called canonical 
expressions. We begin by introducing the moments 

K\i = % a tJ K (ft, i - 1, 2, . . . n), . (44) 
i 

and observe that, by associating with the n identities 

\\{X hU = 1 (h - 1,2, ...n) 
i 

the \n(n — 1) conditions of orthogonality of two different direc- 
tions X A , X^ of the n-ple set 

%KK U o (h*k), 

we obtain altogether the n 2 relations 

2,AftA n; = 8^ (A, & ],2,...n), 

which express the noteworthy fact that the n 2 parameters A{ 
q/* an n-p/e orthogonal set are the elements reciprocal (in an algebraic 
sense) to the n 2 moments in the determinant \ \ X h ^ || which they form; 
and vice versa that the moments are the reciprocals of the parameters 
in the corresponding determinant || A{ || (cf. Chapter IV, p. 74). 

Further, besides the relations just written which refer to the 
columns, their analogues with respect to the rows also hold good; 
these may be written 

S A AiA A|£ - h{ 1,2, ...n). . (45) 

Taking account of this, if we multiply (44) by A /t(fc and sum 
with respect to A, there results immediately 

a ik = A, t|i A /t|fc (i, k ^ 1,2,.. n), . (46) 
i 

which are expressions for the fundamental tensor a^ in terms of 
the moments of any orthogonal n-ple set. 
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Consider in particular the n-ple set (or one of the n-ple sets) 
A /4 , the parameters of which, \{ 9 satisfy (42) where, for each index 
h, w = co h . These equations, in virtue of (44), may be written 

n 

^j a u^h = <*>hK\i (h i = 1, 2, . . . n). 
1 

Multiplying by A A)A ., and summing with respect to the index Ji, 
while attending to (45), we obtain the canonical expression for 
the a^'s to be associated with (46). viz. 

a lk = S /t co /t (i, h = 1, 2, . . . n). . (47) 

After this the simultaneous reduction of the two quadrics to 
orthogonal form is easily effected by substituting for the original 
variables A 1 the n linear combinations 

z h - SAjiA* (h - 1,2, . . .n). . (48) 
1 

In fact, when we substitute in (41) the values (46), (47) of the 
coefficients, taking account of (48), there results 

n >» 

<f> = ^z,, 2 , ^ - JL h o> h z£. . . . (49) 
i 1 

The condition that the A' 's should be parameters, that is to say 
that the expression (41) for <j> should have the value 1, becomes 

n 

in the new variables z, 2 A z,, 2 = 1. The mode of variation of ^r, 

1 

when the A's are parameters of direction, is identical with that 
of the ratio -, when the variables are independent; or, if we 

n 

wish, of the quadric *p H h co h z k 2 , when the z's are connected 

11 1 
the relation E 7 , z h 2 = 1. 

1 

Moreover, the stationary values of if* in these cases are pre- 
cisely the roots of the characteristic equation (43). 
The form 

0 = iyayXX 
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is an obvious generalization, for any value of n, of the expression 
(40') for K, which defines the local distribution of curvature in 
a V 3 . 

The above considerations manifestly apply also to the 
behaviour of the curvature K when the direction of the section 
is supposed to vary, a topic which has already been discussed in 
a more elementary way. 



13. Geodesies infinitely near a given geodesic. 

We shall conclude this chapter with a discussion of the 
extension to n dimensions of a classical formula due to Jacobi, 
which defines in a very simple way the aggregate of those geo- 
desies g of a surface which are infinitely near a given geodesic 
&, called the geodesic base, Jacobi gives the linear equation 

where y denotes the distance (normal) of any point M of g from 
the base, a the arc of the base measured from an arbitrary origin 
O up to the projection P of M upon B, and K(a) the Gaussian 
curvature of the surface at P. 

(J) is simply, in Poincare's phrase, the eqwition of variations 
of geodesies starting from B. There can be deduced from it, as 
we know, 1 some very remarkable consequences with respect to 
the behaviour of geodesies in the immediate neighbourhood of 
the base, the nature of the surface intervening only through its 
total curvature K. This is obviously an intrinsic question, 
depending entirely on the metric of the surface (as defined by 
its ds 2 ), and not at all on the different configurations which the 
surface can present in space. 

It naturally suggests itself that we should try to extend the 
study of this subject of geodesic deviation to a Riemannian 
manifold V H of any number of dimensions. We have long had, 
of course, the equations of Lagrange defining the geodesies of a 
V u9 in a form the convenience of which, whether from the point 
of view of theory or of notation, is all that could be desired. 

1 See, for example, Darboux, TMoric dts surfaces, Vol. Ill (Paris, Gauthier- 
Villars, 1894; new impression 1923), Chap. V; or Blasghkk, Vorlesungcn liber 
JHfferentiatgeometrie, Vol. I (2nd edition, Berlin, Springer, 1924), §§ 83-88. 



GEODESIC DEVIATION 



209 



These equations we may therefore use for the purpose of forming 
the equations of variations. Then with the help of Bianchi's idea 
(Chapter V, p. 139) of the derivative of a vector attached to 
the points of a line of V n , we reduce these equations to a condensed 
form, geometrically suggestive and of course invariant. The actual 
construction of the equations (linear of the second order, n in 
all, with the same number of unknowns) requires no further 
data than the base B and the metric of the manifold (especially 
Riemann's symbols) along that curve. 

We find that this system of equations admits a linear first 
integral, which in its turn leads to a linear relation in finite terms 
among the unknowns. We are thus left with a system of n — 1 
equations or, in the special case of an ordinary surface, with a 
single equation, as in Jacobi's classical result. 

To bring the final system of equations to as simple a form 
as possible we have to make a suitable choice of variables. Now 
we have already seen (§ 11, p. 164) that it is possible, begin- 
ning with any co-ordinates x, to define new co-ordinates y in 
terms of which ds 2 becomes locally Cartesian, in the sense that 
the derivatives of the coefficients a ik all vanish at an assigned 
point 0. We have also seen (Chapter VI, p. 167, footnote) that it 
is possible, any curve B being given, to choose co-ordinates y 
for which ds 2 Ixas this locally Cartesian character at every point 
of B. When B is a geodesic, the system of co-ordinates y which 
(§17) will finally be used has the following properties, which we 
merely state here without proof. 1 Let M be any point in the 
immediate neighbourhood of B, P the orthogonal projection of 
M upon B. Then y tl is the arc of the base B measured from an 
arbitrary origin 0 up to P; the y a '& (a == 1, 2, ... n — 1) may 
be regarded as components of the elementary vector PM in 
n — 1 directions mutually perpendicular and all perpendicular 
to B, chosen arbitrarily at 0 and carried by parallelism 
along B. 

14. Geodesic deviation in an n-dimensional manifold. 

Consider, along with the geodesic B, any other geodesic g 
(more precisely, an arc of g) belonging to the immediate neigh- 

1 For a complete development of this point, compare the paper "Sur l'ecart 
geud&jique "in Mathcwwtiuche AnncUen (Vol. 97). 

( D 056 ) 8 
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bourhood of -B. 1 Corresponding to every point M of g, take the 
point P of B having the same y tl as M and the rest of its z/'s ;= 0. 
It is important for what follows to fix precisely the relation 
between an elementary arc ds of g and the corresponding arc 
da = <Zy n of 2?. Throughout the neighbourhood of JB wc have, 
for the coefficients b lk of ds 2 in terms of the co-ordinates y, the 
Euclidean values 

b ik - 0 (i 4= k), b u - 1, 

neglecting quantities of the second order. 
It follows that for anv curve whatever 



the quantity under the radical differing from its exact value only 
by terms of the second order. For y, both the y a and the 

may be regarded as infinitely small. It follows that 

ds ds ^ 

to a second approximation— a generalization of the elementary 
fact that a segment infinitely near (with respect to direction) 
a given right line differs from its projection on the line by an 
infinitesimal of the second order. 

15. Invariant form of the equations defining geodesic deviation. 

We have now to form the equations defining in general co- 
ordinates the behaviour of any geodesic g infinitely near B. 
Put 

*i - + .... (50) 

where the £"b and their derivatives with respect to a are infinitely 
small. 

The f*'s represent the increments of the co-ordinates x t of 
a point P of B which passes to a corresponding point M of g: 
they can be regarded as the contravariant components of the 

1 In the strict sense, that is to say, with the understanding that not only are 
changes of position of corresponding points on JS and g to be small, but also 
changes of direction of corresponding tangents. 
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elementary vector PM = %. Adopting a more general point of 
view, we shall now regard this vector as not necessarily perpen- 
dicular to B, its orientation depending on the law of corre- 
spondence between the points P and M of our two geodesies. 
We can therefore no longer assume that the elementary arc 
ds of g is (as in § 14) equal to the corresponding arc da of B\ 
but, the displacement in question being always infinitely small, 
we can foresee all the same that, if we put 

* « 1 + A, (51) 

da 

the elongation (or coefficient of dilatation) A remains infinitely 
small with the £"s and their derivatives. This will be proved 
formally in a moment. Meantime, differentiate the formulas (50) 
with respect to cr. We have 

= IT. - - • - (50') 

ocr da 

where x t is written for ~, and <j>i for 

ds da 

A second differentiation gives 

dH l 



+ d***> = (f, > + do*' 



From (51), d 2 s _ d\ 

and (50') can be written 



da 2 da 



1 i • _L # 

— <Pi — — A»; + 

da 

all this holds without any assumption concerning the smallness 
of A. 

dX 

Considering now A and as infinitesimal, and neglecting all 

"a d^s d/A d\ 

terms of the second order, we may replace x t = x { by <fi i9 

da 2 da da 

and consequently take the equation for x t to be 



• • (ao "> 
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Now (Chapter V, p, 134) for any geodesic g we have 

n 

$i = — Z/jk {jK i} Xj x h (i = 1, 2, . . . «). 
/ds\ 2 

Multiply these equations by ( —~ l , and make the following sub- 

\da/ 

stitutions: (50) in the {jk 9 i)> (50') in the x p (50") in the 

(%)'*> 

Remembering that 

A = — *}^^*» 
i 

since JS is a geodesic, and of course retaining terms of the first 
order only, we find 

Next, for the sake of showing explicitly the contravariance of 
the <f>i (parameters of the direction tangential to B), put 

<j> t (o*) -= (i 1, 2, . . . n). . . (52) 

The equations just obtained (if we also change the indices i and 
j into r and i) become 

= - 8 f !*' 6 £ b h f * (53) 
i dar* 

We proceed to transform these equations for the sake of showing 
their invariant structure with reference to any change of co- 
ordinates. For this purpose we introduce Bianchi's conception 
of the derivative vector of a vector £ given as a function of the 
points of a line (Chapter V, p. 139). 

If B is the line, the contravariant components of the vector 
D\i the derivative of are given by the equations 

(D%y = d -f + { ih, r } #e (r = 1, 2, . . . n). (54) 
oar i 
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For co-ordinates which are Cartesian either rigorously, or else 
locally along the line B, we have simply 

{m = %• 

With the help of the vector § > it is easy to give an explicit expres- 
sion for the elongation A, without making any hypothesis as to 
its order of magnitude, the equations (50) and (50'), which we 
shall use, being exact. 

On g we have identically 

n 

E fr a,,i,a? r 1. 
1 

/ds\ 2 

If we multiply on both sides by (^) > an d replace (in the a,/s) 

ds 

the #/s by their values (50) and the xj& by (50'), we find at 

da 

once (neglecting terms of the second order with respect to f ) 

We have, moreover, the well-known identities (Chapter V, p. Ill) 
??* - 2, [a /u {«, h} + a hk {il, h}]. 

/ds\ 2 

If in the last sum of the expression for ( ) we replace the indices 

r, h by k, I and take account of the identity just written down, 
replacing also, as in (52), 

n . n n 

°/r ^ a /» ^ Q hh <j> k9 

111 

^ &A» 

we find, on account of (54), 
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The vector D% is infinitely small at the same time as the g i9 a 
and their derivatives. We have, therefore, by extracting the 
root, neglecting terms of the second order, and attending to the 
definition (51 ) of the elongation A, 

A = S r (Z>5r&„ .... (51') 
1 

which shows its infinitesimal character. 

Naturally D% admits in its turn a derivative vector D 2 %. Its 
contra variant components are defined, from (54), by 

(Z)2?) r = d(B$r + ^ { kh r y b , {Dl)l 

Introducing on the right-hand side the expressions (54) for (DQf 
and {D%) 1 (after making some literal changes in the indices), we 
obtain 

+ S„(*J, '}P d £ + ^{U, r) (i*. l}bfb<e, (55) 

the second members, like the first, constituting a contravariant 
system. Bringing out explicitly the differentiation with respect 
to cr, and making some changes of indices, we can write 

(D%) r = d ^ + 2 S y , {jh, r)b» d -^- -f S« (55') 
where we put for brevity 

i 9x A i dor 

+ Z Ml {lh, r}{ik, l}b*b h e. 

We may note in passing that, in the auxiliary quantities S (r) , 
the index r is purely ordinal: we have placed it above, but in 
brackets, so as to avoid the suggestion that the S (r), s form 
a contravariant system, which they do not. 
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For our purpose it is sufficient to replace in H (r) the derivatives 

da 



— = by their values 



-^{ihjjb* b\ 

as given by the equations for geodesies (Chapter V, p. 134), so that 
we may write 



3<" = S^-^-'W** 



+ Sou [{», r} ?} - {Zfc, r) {**, I}] 6' 6 ft £*. (56) 

1 

If then we add S (r) to the two members of equations (53), attend- 
ing to (55') and to the definition of Biemann's symbols (§ 2), 
the equations take the invariant form 

imy-*±b r = -^{irMW&e (r=l,2,...n). (57) 
da i 



16. Geodesic deviation. Specification of the differential system. 
First integral. Linear relation in finite terms. 

The system (57), taken along with the valun of A given in 
(51'), contains n + 1 equations, with the same number of un- 
knowns; but it is easy to foresee, from the method of obtaining 
it, that it cannot by itself determine completely all the unknowns; 
there ought to remain an element of indeterminateness arising 
from the arbitrary nature of the law of correspondence between 
the points P of B and M of g. More definitely, we can prove 
that the definition (51') of A, or rather the relation derived from 
it by differentiation 




. . (51") 



is a consequence of equations (57) themselves. To establish this, 
note first that, for any vector v whose contravariant components 
are v r . we have 

d » r , * dv* . » r db r 
da % i aa i aa 
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db 

But the derivatives r of the moments of a geodesic B satisfy 
da 

the equations (Chapter V, p. 134 and p. 139 (52') ) 

which express, we may say, the autoparallelism of the geodesic 
B in terms of its moments b r . The preceding identity, after inter- 
change of the indices h and r in the last sum, therefore takes the 
form 

4- S r v r b r = 2, (DyY b r . ... (58) 
da i i 

In virtue of this equation, the first member of (51") now becomes 
(if we replace the vector v by D%) 

aa i 

That this expression vanishes we can easily prove from equation 
(57), making use of the properties of Riemann's symbols, as 
follows. Multiply both sides of (57) by b r and sum with respect 

n 

to r, noting that S r 6 r 6 r — 1. The right-hand member can be 
i 

written n 

-2 thkrp {ir,hk}a jv b»b<b h e. 



We now sum with respect to r 9 thus changing the Riemann 
symbol to the s3 r mbol of the first kind, and then make use of the 
antisymmetry of (ip, hie) in i and p (p. 179), from which it follows 
that the sum is zero. 

Equation (51') is therefore simply a particular integral (or 
invariant relation) of the system (57); its role reduces to that 
of fixing one of the constants of integration. As the system (57) 
contains n + 1 unknowns, to make it determinate it is necessary 
to associate with it some other condition — a circumstance easy 
to understand from the geometrical point of view, since we have 
still to fix the law of punctual correspondence between g and the 
base, 
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From a formal point of view the easiest way to complete 
the system (57) is to cut out the unknown A by putting 

da 

The equations (57) are thus reduced to the normal form (Chapter 
III, p. 36) 

[D%Y = - Va* hk}V&e (r = 1, 2, . . . «); (I) 

1 

and we see that, on account of the identity (51"), the system (I) 
admits the first integral 

£, (D%) r b r = A = constant, . . . (II) 
i 

expressing the fact that there is a constant linear dilatation 
when we pass from any arc of B to the corresponding arc of g. 

Since, on account of the identity (58), the first member of 

a 

the integral (II) is simply the derivative of f ' b n it follows that 

i 

every solution of the differential system (I) gives also 

i,€'b r = AaH C, . . . . (Ill) 

where C is a second constant. 

If in particular we take A — 0, we see that we can associate 
with the differential system the relation 

£ r £'A - c. 

i 

This gives the translation into analysis of the obvious geometrical 
fact that we can assign the correspondence between the points 
M of g and P of B in such a way that the (infinitely small) vector 
PM will have its orthogonal projection upon the tangent t to 
the base at the point P equal to a constant C. Such a law of 
correspondence implies, in virtue of (II), that there is no altera- 
tion of length (A == 0) as between the arcs of B and the corre- 
sponding arcs of g m To particularize still further, if (7=0, 
we arrive at the orthogonal law of correspondence (PM perpen- 
dicular to B) considered in § 13. 

( D 035 ) * 8 # 
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It is scarcely necessary to add that, in order to substitute 
other geometrical laws of correspondence, we have only to 
associate with the system (57) the analytical translation of the 

dX 

law chosen, instead of the law - - = 0. For example, if we wish 

da 

PM to be inclined to B at an angle if* (constant or a given func- 
tion of a) the additional equation will be 

n 

E r £ r 6 r ~_ £ cost/r, 
1 

where £ = \J £ t „ a ik £ l 

I i 

represents the length of the vector I*. 

In a case like this, some slight supplementary discussion of 
the complete system will be needed — its reduction to the normal 
form, determination of the number of constants of integra- 
tion, &c. 

17. Reduced form of the differential system (I) in terms of 
the co-ordinates y. 

We now return to the co-ordinates y, and fix definitely on 
the orthogonal law of correspondence between the base B and 
any geodesic in its neighbourhood. As we have just seen, such 
a correspondence is expressed analytically by the differential 
system (1), with the specifications A C 0 of the constants 
of integration connected with (11) and (III). 

As remarked in § 14, the co-ordinate y n of M is identical with 
that of P. Since the other co-ordinates y a (a = 1, 2, . . . n — 1) 
of P are 0, the variations 77* of the co-ordinates y are respectively 

r) a = V* (a = 1, 2, ...» - 1), if = 0, 

thus justifying the name of Cartesian components of the normal 
displacement or deviation PM = yj which we give to the y a 's. 

Moreover, the parameters 6* = of the base B vanish 

da 

for i = 1, 2, ... n — 1; and b n =•- 1. Christoffel's symbols also 
vanish along B, as well as their first derivatives with respect to 
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a (or, what comes to the same thing, to y n ), and consequently 
Equations (I) thus become 

where, in both sums, we have suppressed the term corresponding 
to the value n of the index, since every Riemann's symbol 
which has its last indices equal vanishes (p. 177). 

The first (/roup (!') (comprising n — 1 linear equations of 
tin 1 second order) defines the n — 1 Cartesian components of the 
(normal) deviation PM. The last equation reduces to an identity, 
as may be seen as follows. Riemann's symbols of the second 
kind are in all cases connected to those of the first kind by the 
relations n 

(ijjik) = 2> r a jr {ir 9 hk}; 

we have, moreover, for the symbols of the first kind (§ 4), 
(ij, hk) « — (ji, hk). 

In our case the coefficients a fl of ds 2 reduce, on B, to 0 (for r 4= j) 
and to 1 (for r - j). We have therefore, on the base, equality 
between symbols of the two kinds whose indices are the same; 
and, in particular, 

{nn, np} = (nn, np) = 0. Q.E.D. 

Of course, the integral relations (11) and (III) are of no further 
account, being now identities on account of the vanishing of the 
b"& (i = 1, 2, . . . n — 1) and of the nth component £ n — rj n of 
the displacement PM. 

18. Case of n = 2 — formula of Jacobi. 

For n — 2, that is to say for an ordinary surface, if B is 
the geodesic base, y 2 — the arc cr, and y x (= y) the normal 
distance from M to B, the system (1') reduces to the single 
equation , 9 

2- -<«.»>* 
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Now (§ 9) for any co-ordinates whatever, the Gaussian curvature 
K of a V z is expressed by the ratio *• 

(12, 12) _ (21, 21) 



a denoting the discriminant of the ds z of Y 2 - 

For our co-ordinates, which are Cartesian along B, a — 1, 
and Riemann's symbols of the second kind are (§17) the same as 
their homologues of the first kind. The equation defining y is 
therefore none other than the equation of Jacobi (§ 13) 

2 + « - 



CHAPTER VITI 

Relations between two Different Metric* referred to 
the same Parameters; Manifolds of Constant Curvature 

1. Differences between Christoffel's symbols relative to two 
different metrics assigned to the same analytical manifold. 

We introduced in Chapter IV notions of tensor, cxwariance, 
&c, relative to an analytical manifold V n9 i.e. to the aggregate 
of n variables x l9 x 2 , . . . x u \ we then, in the third part of the 
following chapter, considered the metrical manifolds obtained 
by associating with an analytical manifold V n a specified (but 
arbitrary) positive and definite differential quadratic form. 

There is clearly no reason against assigning in turn to the 
same analytical manifold two distinct metrical determinations, 
defined by the two quadratic forms 1 

« 

ds 2 = ^a^dx^dx^ . . . . (1) 
i 

/' 

ds' 2 — a u . dx t dx k . . . . (1') 
i 

*As a geometrical interpretation, we can think of two distinct V n '& whose 
points are in one-to-one correspondence, ho that a set of n values assigned to 
afj» x* . . , X* can be represented either by a point P of one, or by the corresponding 
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From each of these forma we can obtain a set of Christoffers 
symbols, which we shalf denote by 

{ih, r) and {ih, r}' 

respectively, and from these we can construct Riemann's symbols 

{ ir 9 hk } and { ir, hk]\ 

and the analogous symbols of the first kind. In this chapter we 
propose to find the relations between the symbols relative to the 
two metrics, and then to apply the results to geometrical con- 
siderations. 

We shall begin by forming the differences 

{t», r}' - {ihr} = p fc ... (2) 

we shall justify the positions of the indices on the right by showing 
that the p's constitute a tensor covariant with respect to i and 
h and con Ira variant with respect to r. 

To prove this, consider an arbitrary contra variant system 
(j r whose elements are functions of position, and a system (also 
arbitrary) of increments dx h of the independent variables. We 
know (cf. Chapter V, p. 138) that the expressions of the type 

i 

constitute a contra variant system. The same result is of course 
true for the analogous expressions corresponding to ds'* 

1 

and also for the differences 

r *r_ T r = 7 ± ihp r h £i dx ^ 
1 

The fact that this expression is contravariant means that, 
denoting by u t . an arbitrary covariant simple system, the expres- 
sion 

1 1 

point P' of the other. E.g. a map and the Burface of the earth are two V^'s with 
different metrics (one is Euclidean, the other not), and to every pair of values, <p 
(for the latitude), X (for the longitude), correspond one point on the map and one 
point on the earth. 
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is an invariant; and if we examine the right-hand side of this 
equality we see that its invariance requires that p r ih should be 
a tensor of the kind stated. 

It will be convenient for subsequent purposes to introduce 
also the associated covariant system 

n 

Pihj ~ ^r a jrPih (2') 

2. Differences between the covariant derivatives. 

Given a generic tensor A^, where as usual (?') and (h) denote 
the aggregates of m indices i x . . . i m and \x indices h x . . . respec- 
tively, we can consider its covariant derivatives with reference 
to either the first or the second fundamental form, i.e. with 
respect to either ds 2 or ds' 2 . A generic element of the system 
obtained by differentiation relative to the first form will be 
denoted as usual by <A { '^ k , and the analogous expression relative 

to the second by (A { ^)' r We wish to evaluate the difference 

To find it we can use the explicit expression (p. 146, 
formula (4) ) for the covariant derivatives of a generic mixed 
system. These are linear in Christoffers symbols, so that the 
differences in question will be linear in the p's; the expression 
for them will be 



f j<*V - E £ n' A (h) 



(3) 



These general formulae can also be obtained, without using 
any special m&mwia technica, from the original definition of 
covariant differentiation, with respect to a given fundamental 
form, of a generic tensor. It is therefore well to remind the reader 
that, for an arbitrary displacement dx h we assigned to the 
symbol d, when prefixed to a function of position, the usual 
meaning of the infinitesimal increment (the differential) caused 
by the displacement (cf. Chapter VI, p. 145); while for a generic 
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vector ^ an d its contravariant components f we assumed 

de = -S.-^a.r}^^; ... (4) 

1 

i.e. we defined the expressions dg r as the increments dependent 

on parallelism. All of this referred to the metric (1), which was 

then supposed fixed once and for all. We can of course follow 

the same procedure taking (1') as the fundamental form; but to 

avoid ambiguity it will be well to denote by d r the increments 

of the £ r, s due to the same displacement as before, so that we 

shall have n 

dT =~- -~V lh {iKrYVdx h . . . . (4') 
i 

It will also be useful to introduce the operator 
d* d' - d. 

Since for functions of position d' and d have the same meaning, 
we have 

d*f^ 0 

for any function of position /; for the contravariant components 
of a generic vector !• we have, subtracting (4) from (4'), 

d*r = - £ (A pt A £> dx h (r = 1,2,... n). . (5) 

1 

Similarly, given the covariant components u h of some other 
vector u, we find that 

n 

d*u h = Ejtp^ujdzt (5') 

Now, in order to prove (3), we need only consider the invariant 
multilinear form F whose coefficients are the elements of the 
given tensor A ( *y we know that the covariant derivatives A { '^ k , 
{A^) k are merely the coefficients of dF and d'F respectively. 
If now we take the identity 

d'F — dF = d*F, (6) 

and apply the operator d m on the right, using the property 
d*f = 0 for any function of position/, and (5), (5'), then 
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equating coefficients of like terms on each side we get 
formula (3). 

As a very simple application of this process, we shall deter- 
mine directly the values of the differences (A r ) k — A r \ k of 
the derivatives of a covariant simple system A r . We start 
from the invariant form 

F i r A,r (7) 

L 

and consider the usual generic displacement, determined by the 
increments dx t of the independent variables. We shall have, 
with reference to ds 2 , 

dF - Z,. k A r{k rdx k ; .... (8) 
and with reference to ds' 2 , 

d'F = $. rk {A r )' k rdx k (8') 

1 

Further, applying the operator d* to F, and remembering 
that d m A r is zero, and that is given by (5), we get 

d*F = -z lih A rP : h e dx h . 

i 

The identity (6) therefore takes the form 

n n 

Xrk[(AX — A r > /C ~]g r dx /C — 1> tih A v p T ih € l dx h . 
i i 

Replacing on the right h by k y and interchanging i and r, 
we get the typical term on the right also in a form involving 
£ r dx k \ hence, equating coefficients, we have 

{A r )' k -A rXk -r - k P i k Ai. ... (9) 

This is the particular case of (3) which we shall require in 
the next section. 

3. Differences between Riemann's symbols. 

We propose in this section to calculate the differences 
Ri hk = hky — {ir, hfc\, 
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which, being differences of two like tensors, are by definition 
tensors of the kind indicated. The calculations could be effected 
directly on the expressions defining Riemann's symbols (Chapter 
VII, p. 175), but the long formal expansion can be avoided by 
the following method. 

Let A r be any covariant simple system, and g r any contra- 
variant simple system, and consider the invariant form (7). 
Applying to it the operator 1 A 8d — d$ with reference to 
ds 2 (cf. Chapter VII, p. 173), and remembering the fundamental 
properties of this operator, we shall get 

AF =- S,4,Ar. 

1 

or, expanding A£ r by formula (4) of Chapter VII, 

AF = - i ir/lk {tr, hk} tA r Jx, t 8x,. 

Similarly, with reference to ds' 2 , we can introduce the operator 
A' 8'd' - d'V, and write 

A'F == - ^{i^hky^A^x,,^, 

and subtracting the former equation from this we get 

(A' - A)F = - R:„ k {' A r dx h Sx k . . (10) 

1 

We shall now obtain by another method the expression for 
the same quantity as a quadrilinear form in the quantities 
f 1 A. r dx h 8x k , and hence, equating corresponding coefficients of 
the two forms, we shall find the expression for R^. 

Note first that 

A' - A (8'd' — d'8 f ) - (8d - d8) 
« (8'd' - 8d) - (d f 8 f — d8). 

Since the second expression in brackets is obtained from the first 
by interchanging d and 8, we need only calculate the expression 

lr To avoid ambiguity we have here replaced the symbols 5 and 5\ used in 
Chapter VII to denote two distinct systems of increments, bv d and 0 
respectively. 
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for 8'd' — 8d. Fisher, in making this calculation we can ignore 
all those terms which remain unchanged on interchanging d and 
8, since they will disappear when we take the difference; we shall 
denote them collectively by X(d, 8). 
Introducing the notation 

d'— d = rf* 8' - 8 = 8* 

we have 

S'rf' = (8 + 8*)<Z' = M'+ 8*d' 
= 8(d + d*) + 8*d' 
= Srf + 8d* + S*d', 

so that 

S'd'- Sd — S<2* + S*rZ\ 
We therefore get 

(A' - A)F = 8d*F + 8*d'F - (dS*F + d*8'F). 

To calculate the first term we first apply the operator d* 
to the form F, remembering d*A, = 0 and (5). We get 

d*F =- 2,J, 
i 

= — 2 lhy A r p r tll i;' dx h . 
1 

Applying the operator 8 to this form we get, from the definition 
of covariant differentiation, 

8d*F - - £ lh ,AAp'nX£ l dx h 8x k 
i 

n 7i 

~ - — S l7trfr /4 I fr f 4 8^ — ^irhk p r ihAr\k£ % dx h 8x k . 
1 1 

Observing that the second sum can be written in the form 

n 7i 

^rk^r\k %Xk ^>ih Pih € ' dx h> 
1 1 

we have, applying (5), 
8d*F = - ktokR^k^ArdxiBz* + £ rk A, {k 8x Jc d*£\ (11) 
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To calculate the second term 8*d'F, we apply the operator 
8* to (8'), and get 

8*d'F = 2 rk (A r )' k e8*dx k + X rk (A r )' k 8*£ r dx k . 

1 1 

In the second sum, we can substitute for (A r )' k the expression 
given by (9), and we get 

8*d'F = Z rt (A r )' k F8*dx k 

1 

+ X rk A,. lk 8* £*dx k - h^pi.A^edx,. (12) 

1 1 

We must now add (11) and (12). In doing tins, we notice 
that the first sum in (1 2) is symmetrical in d and 8, since expanding 
8*dx k by (5) it can be written as 

n 

— 2f*a {A r )'k i r Pih d*i 8x h> 
i 

while the second sum in (12) and the second in (11) change one 
into the other if we interchange d and 8, so that their sum is 
symmetrical. There remains therefore 

8d*F + 8*d'F 

- X(d, 8) - A r dx h 8x h - E^pi.A^rdx,. 

I 1 

In the last term we substitute for S # £ r its explicit expression, 
so that it becomes 

71 

— ^irkih P\k p\h i l A § X h dx k . 

1 

In order to be able to collect the terms in the two sums with 
the common factor f * A r dx h Sx k , we apply the substitution for- 

) to the indices in the last sum, so getting 
I i r k h/ 

8d*F + 8*d'F = X(d, 8) - h,Mn\k ~ Si pipU 

i i 

The expression obtained by interchanging d and 8 on the 
right (remembering the definition of X) is 

X(d, 8) - X M [>i „ - S, pt h fa pArSzt dx k . 
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Interchanging h and k in the second of these, and subtracting, 
we get finally 

(A' - A)F 

n n 
= — ^ikrk [Pih | * — P r ik\h — ^l (Plh P l ik ~ Pifc Pih)1 £ 1 A r dx h 8x k . 

Comparing this with (10) we get the required formula: 
Rkk - \ir,hk\'- {ir,hk\ 

n 

= Pili\k — Pik 1 h — (/>ft ft* — PtoPih)> • (I 3 ) 

which expresses the differences between Riemann's symbols of 
the second kind in terms of the differences between Christoffel's 
symbols. The analogy between this formula and that defining 
Riemann's symbols (p. 175, formula (3) ) should be noted. 

If we contract (13) by multiplying by a Jr and summing with 
respect to r, and then use the formula 

n 

Pihj\k ~ a jrP\h\k 
1 

obtained by covariant differentiation of (2') with respect to ds 2 , 
we get the covariant system 

n n 

^r a j> Km = P,I>j\k — Pikj\h — (PlIijP l tk — PlkjPUi)' 

It is to be noted that this does not give the differences between 
Riemann's symbols of the first kind. In fact, substituting on the 
left the expression for R r lhk , (14) becomes 

n 

H r aj r {ir 9 hk\'— (ij, hk) 

= Pihj\k — Pikj\h — S £ {pihjpik — PikjPih)> (14 r ) 

and the first sum is not the same as (ij, hk)\ which would be 

n 

4. Case of two metrics in conformal representation. 

We shall now apply the formula (14) to the case in which the 
two fundamental forms (1) and (!') differ only by a factor. As 
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both forms are positive, this factor must also be positive, so 
that we can denote it by e 2r ; we shall therefore suppose that 



The geometrical interpretation of this condition is quite 
simple, namely, that the correspondence between the two mani- 
folds is such that infinitesimal segments are proportional, or 
there is similarity of infinitesimal parts. It follows that the 
angle between two curves (the angle between their tangents 
at the point of intersection of two infinitesimal elements) is equal 
to the angle between the corresponding curves; hence the name 
of conformal representation. 

In order to calculate (14), we shall obtain in turn, first, 
ChristofTel's symbols of the first kind for the two forms, then 
those of the second kind, from which we shall get the p^'s, 
and lastly the p l/y 's and their derivatives. 

We start from the relations equivalent to (15) 



or 



ds'* = e^ds*, 
d$' = e T ds. 



(15) 



a lh ^ e iT a ik (i, h = 1, 2, ... w) 
and shall calculate the symbol [ih, /]'. We get 



(15') 




. f dr . 3t 9t~1 

+e2 K% +% a*~ a »^J 



= e* T ( I] + a u r h + a ih r t — a Ui r t ) 9 



where r h stands for T , &c. 

To construct the symbols of the second kind, or 



n 



{ih,r}' = S,a^[zA,Z]', 



we observe that the coefficients a" 1 are, by definition, expressible 
as the quotient of a determinant A' fl of order n — * 1 (the comple- 
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mentary minor of a rl in the determinant |[ a tl || ) by a determinant 
a' (namely, || a rl \\ ) of order n. Remembering (15'), we see that 
in these determinants we can take out a factor e 2r which is 
common to every element, so that we can write 

A' a - A Ht a' e 2 ™a, 

where A rl and a denote the determinants corresponding to A' rl 
and a', but relative to the coefficients a rl . We thus have 

a"* - <r 2r a", 

and therefore 

{ih, r}' = 2,a ,z ( [*A f /] + a tf t a + a z/l r t — a t7l r,) 
i 

- { ih, r ) + 8 f r r /t + 8J r,- — a lh r r , 

where the S's as usual denote a factor which is 0 or 1 according 
as the indices are the same or different. 

The difference [ih, r}' — {ih, r) is therefore given by 

P r * =- 8rT A + 8^T t -a (ft r'- . . . (16) 

Multiplying this by a jr and summing with respect to r, we 
get (using (2') ) 

Pihj = <t><jT h + a h) T L — a ih r r . . . (16') 
By covariant differentiation with respect to ds 2 we get 

Pihj\k — a () T hk + a hj r ik ~~ a ih T jk\ 

subtracting from this formula the analogous one obtained by 
interchanging h and k (so as to form the first part of (14) ), and 
remembering that r, ik — r kh , we find that 

pihj\k — Ptkj\h — a hj T ik — a k) T ih ~ a ih T jk + a ik T jh- (17) 

The second part of (14) can be constructed with the help of 
(16) and (16'). We shall first calculate 

w n 

%iPihjP l ik = s « ( a ij T h + ahjTi — <*ihTj) (^T/f + 8*t< — a ik r l ) 
X 1 

— a hj At — a ih Tj r k — a kh Tj t x + a ik r 3 r h , 
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n 

where At — r t r l (the first differential parameter). The third 
1 

term and the last term cancel out. Denoting by X the aggregate 
of the four terms underlined, which are unchanged if we inter- 
change the indices h and k, we can write 

n 

2* Pihj p\k = — a ik <*hj At + a hj r k t { — a Ul r J r lc + X . 

We shall now subtract from this the formula obtained by 
interchanging h and /c. We shall get 

u 

S« (Pt,,j pl k — Pikj p\h) = ( a <h a jk — <*ik tyh) At 

+ a hj T k T i ~ a kj T h Ti — a ih TjT k + a ik Tj T tl . 

Using this and (17), we get the right-hand side of (14') in the 
form 

— a ih (jjk — TjTk) + a* ( T jh — TjT h ) + a )h (r ik — T, T k ) 

— a>jk(7ih ~ T t r h ) — — At. 

The left-hand side of (14'), using (15'), can be written as 

e-** S, a jr {ir, hk\' - (ij, hk) 
i 

or e- 2T (ij, hk)' - {ij, hk). 

Finally, formula (14'), for two metrics in conformal repre- 
sentation, can be written in the form 

e ' " T (ij, hk)' — (ij, hk) — a lh (r jk — r, r k ) + a tk (r Jfl — r 3 r h ) j 

+ <fy, ( T ik — Ti n) — a jk ( T ih — t* T h) — (a<ih a jk — a tk a jh) At. J 

This formula was found by Finzi, by another method, as early 
as 1903. 1 

] Cf. "Tje ipersuperficie a tre dimension! cho ri poxaono rappresentare eonforme- 
mente sullo Npazio euelideo", in Atti del It. hi. Vcncto, Vol. IjXII, pp. 1049-1002. 
The later researches of Finzi and Schouten on the manifolds of any number of 
dimensions which can be conformably represented in a Kuclidean space of the same 
number of dimensions, should also be mentioned. Cf. Rend, dclla R. Aca. dei 
Lincei, Vol. XXX (first half -year, 1922), pp. 8-12, and Vol. XXXI (first half- 
year, 1923), pp. 215 218, and Schouten's book cited at Chap. VII, p. 172. Cf. also 
ft. J. Stuuik: Qrundzuge der mehrdimensioimlen Different ialf/eomet7'ie (Berlin, 
Springer, 1922), Ch. IV, § 13, p. 1 50, where Schou ten's results are given with 
bibliographical uotes. 
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This formula can be given a simpler form by putting 

u = e~ T , 

so that (15) becomes 7 

ds'* = -i ds*. 
u 2 

We then have 

Uf = — C- T T„ 

e- T (r tk — TiT k ), 

from which we get 

T = — Ui e T 9 r ik — TiT k = — -L* 

n 11 Aw 

At = S^a^TiTjt = e^^a^UcUj, = tP* Au = - 
i i 

Thus (18) becomes 

u*(ij,hk)' - (ij,hk) 

U ik U ih U ik i u ih / vAw /10 » 

= aUl u ~ aik u ~~~ ajh a ajk u ~ ^ ajk ~ aik ^ ' ( * 

At the end of this chapter we shall have occasion to point 
out an interesting geometrical application of this result. 

5. Isotropic manifolds. 

Leaving aside for a moment this order of ideas, we propose 
to study those F n 's in which the Riemannian curvature, as 
defined in Chapter VII, pp. 195-198, does not depend on the 
section, but only (if it is variable) on the point. This is expressed 
analytically by the fact that the expression for K given by (31) 
of the preceding chapter is independent of the u's and the w's. 
We shall see that these F,/s, which we shall call isotropic, i.e. 
with (locally) constant curvature, are characterized by a parti- 
cularly simple expression for Riemann's symbols. 

We observe first that a fairly simple algebraic combination 
of the coefficients a ik , which possesses the fundamental properties 
of Riemann's symbols, is the following: 

where y is a priori any function whatever of position. Everything 
reduces to proving that when these quantities are substituted 
for the symbols (y, hk) in (31) of the preceding chapter, the 
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resulting value of K is independent of a and v. In fact, making 
the substitution, we have 

K = 2,,/* (a* — a^a^) u i v j u h v k 

sm 2 a 1 

sm a L 1 1 1 1 J 

and since n n 



1 

n 



^ik a ik ui ^ = 2 jA a, A v ; w* = cosa, 
1 1 

it follows that 

K = - V [1 - cos 2 a] =r= v. 
sm 2 a 

Hence the Riemannian curvature of a V n whose Riemann's 
symbols are the expressions b Qt hln is y, and is therefore indepen- 
dent of the section. But we can also show that this is the most 
general expression of Riemann's symbols which will make K — y. 
In fact, if we put 

(ij, hk) — b ijt hk + B %j hky 

where Mc has the meaning assigned to it above, we shall show 
that B,j hk — 0. To do this, we insert this expression for (ij, hk) 
in (31); the right-hand side can then be broken up into two parts, 
the first of which, containing the symbols b tj hJc9 is, as we have 
seen, equal to y, and the second, which is 



Billed 1 

must vanish if we are to have K — y. 

The sum just written can be simplified if we observe that 
since Riemann's symbols are antisymmetrical, the two terms 

B, Jt W «M«¥ 
can be collected into a single term; putting 

thiB term becomes 
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Thus the sum for all the permutations hk becomes merely a 
sum for all the simple combinations (h 9 k) of unlike indices, since 
it is useless to consider terms with a repeated index (k — h) 9 
because p hh = 0. We shall denote the sum extended only to 

n n 

simple combinations of the indices by B hk instead of £ /<Ar . The 

1 i 

quadruple sum thus becomes 

u n 

2„ rtv J S hk B ljthk p hk . 
i i 

Proceeding in the same way for the indices i, j, we get ulti- 
mately 

n n 
11 

i.e. an expression bilinear in the j/s. Each summation will extend 
to m — \n(n — 1) pairs; we shall number these (in any order) 
from 1 to m, and put 

Pij = z p, Phk — z y y B lJthk = B^ )Ay) 
(i,j, h,k=^ 1, 2, . . . n; y = 1,2,... w), 

where /? is the ordinal number of the pair ij and y that of hk, 
so that the sum can be written 

^Py ^03), (?) z fi V 

It will now be clear that this expression cannot vanish for 
arbitrary values of the s's, unless all the B's are zero; which is 
precisely what we wished to prove. 

We may therefore conclude that for a V n whose curvature 
is locally constant (i.e. independent of the section) and equal to 
a given function of position K, Riemann's symbols are neces- 
sarily given by the formula 

(ij, hk) = Kia^a^ — a ik a Jh ). . . (19) 

Multiplying by a Jr and summing with respect to j, we get 
the expression for the symbols of the second kind: 

{ir,hk} = Kia^Sl-aM. . . (19') 
The function K, however, cannot be arbitrarily assigned; 
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we shall show in the following section that f or n > 2 it must be 
a constant. 

6. Schur's theorem. 

This theorem states that if the curvature is locally constant, 
it is also the same at all points. The case n — 2 is not considered, 
as there is only one section at each point, so that we cannot 
properly speak of locally constant curvature. 

We shall therefore show that the K of formula (19) is constant, 
or that Ri = Q (I = lf 2 , . . . n), 

where K t represents a generic covariant derivative, identical 
(cf. Chapter VI, p. 147) with the ordinary derivative. 

To prove this, we take the covariant derivative of (19), 
remembering Ricci's lemma. This gives 

(ij, hh) t = K t (a tk a Jk — a ik a jh ). 

Taking three distinct values for k, I (which is' possible, 
since n > 2), the other two relations obtained from this one 
by cyclic permutation of h, Jc, I can be written in the form 

(y> M) h = K h (a ik a jt — a u a jk ), 
(ij, lh) k = K k (a u a jh — a £h a jt ). 

Adding the terms on the left and on the right of these three 
equations, and remembering Uianchi's identity (Chapter VII, 
p. 183), we get 

0 = K t (a ih a /k — a ik a jh ) + K h (a ik a }l — a u a Jk ) 

+ &h i*u*jh — ( 2 °) 

By varying i and j, we thus get \n(n — 1 ) relations, of which 
we shall now make a suitable linear combination. Multiplying 
(20) by a ih a Jh and summing with respect to i 9 j, we find that the 
coefficients of K h K h , K k , are all of the type 

1 

where a, j8 denote two of the indices h, k> I; or, making the two 
summations in turn, of the tj-po 
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where the S's as usual denote either 0 or 1. These quantities are 
therefore always zero, unless we have simultaneously a = h y 
fi = k (which happens in the coefficient of the first term), in 
which case the value is 1. Our linear combination of the equations 
(20) thus reduces to 

K t = 0. Q.E.D. 



7. Canonical form of ds 2 for a manifold of constant curvature. 

Given a Euclidean space S a , we propose to find, if it exists, 
a manifold V n with constant curvature K, which can be con- 
formally represented on S n , or in other words (cf. § 4) such that 
its linear element is given by 

ds 2 



where ds is the element of 8 n . We shall see that this is always 
possible, and the solution of the problem will lead us to assign 
two important forms for the ds 2 of a manifold with constant 
curvature. 

Keeping the notation of § 4, we shall have for Kiemann's 
symbols for the V' n the expression (cf. formula (19) ) 

(ij , A*)' K (a lh a' )k — a' k a jh ) - {a lh a Jk — a lk a jh ), 

14' 

and for S n 

(ij 9 hie) = 0, 

since for a Euclidean space all Kiemann's symbols are zero (cf. 
pp. 173-178). 

We must now substitute these values in the equations (18'); 
these constitute a system of differential equations the integration 
of which will give the function u. Making the substitution, (18') 
becomes 



u u u J u 



~ \ a ih a jk — a ik a jh) — -g = 0 

(i, j, h, k — 1, 2, . . . n) 



(21) 
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These T V^ 2 (n 2 — 1) equations can be satisfied by putting 
u ik — ca^ (i, h = 1, 2, . . . n), . . (22) 

where c is a constant; in fact, substituting these values, they 
take the form 

^ 2 (flih a ,ic — *a «,a) (2cw — K — Aw) 0 
(i, A; = 1, 2, . . . n); 

and in order that they may all be satisfied, we need only make 
the common factor vanish, i.e. put 

2cu — K — Au = 0 (22') 

We have therefore substituted for (21) the system composed 
of the equations (22) and (22'), which holds whatever may be 
the co-ordinates x. If then we suppose, as we always may, that 
the x's are orthogonal Cartesian co-ordinates of S ni so that 

a lk - S, A, - L„ ( a J , u ik == 



our system will take the simpler form 

cS?, (23) 



3 2 « R* 



dx ( dx k 



2CU-K- S, V = 0. 



(23') 



We shall examine separately the two cases c = 0, c 4= 0. 
If c = 0, the system becomes 

, - 0, (24) 

o x L a x, c 

K +^0^y= °> • • • < 24 ') 

and from the second of these it follows that K < 0. Such a 
solution is therefore possible only for manifolds of constant 
negative curvature, since we do not consider the case K = 0, 
which has no special interest, i.e. the case when V' n is itself Eucli- 
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dean. Equations (24) then give, by an immediate integration, 

u = lL v b v x v +6, .... (25) 
1 

where the b v 's, and b are constants and therefore, substituting 
in (24'), 

K+ SA 2 = 0 (25') 

1 

This shows that the 6„'s are not all zero, and that therefore, 
by applying an orthogonal substitution to the co-ordinates, 1 
(25) can be put in the form 

u == kx n (k constant) 
so that (25') becomes 

K + £ 2 - 0, 
or k — %/ — K. We therefore have 

u — \/ — K x M 



and therefore 



dx*+ dx?+ . . . + dx 2 
ds'* = - V- ... ?_3L_ T „» . . (26) 
-Kx* 



This is the canonical form of the line element of a manifold 
of constant negative curvature. It was found by Beltrami 2 in 1868 
by another method. 

Another type of solution which holds for any value of K 
whatever is obtained by supposing c #= 0. (23) gives us the two 
groups of equations 

a* a .k = °- < + *' ' • • • < 27) 



1 The hypcrsurfaces u — constant, i.e. E„ hv — constant, are a set of parallel 

hyperplanes ; wo need therefore only choose the axis x n in the direction perpen- 
dicular to them in order that their equations may take the form Xn = constant, 
and therefore that u = k xw 

"Operc matematiche, Vol. I, p. 419. Milan, Hoepli, 1902. 
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The first group has for its general integral 

u = SfX* (28) 

i 

where X { is a function of x { alone. 
The second group gives 

where differentiation is denoted without ambiguity by dashes, 
since the argument of X t is x t only. 
From this, integrating once, we get 

where the arbitrary constant of integration has been put in the 
form — cx° l9 using the hypothesis c + 0; and integrating a second 
time 

X t =- +6, 
where b s is a constant. Substituting from this in (28) and putting 

u 

b = 2 4 6 t we get the following expression for u: 
1 

u = ° a?) 2 +6, . . . (29) 

^ 1 

containing ra + 2 arbitrary constants. 

We still have to consider (23'); substituting in it this value 
of u, it becomes 2c& — K = 0 .... (23") 

and therefore merely establishes a relation between the two 
constants c and b. 

We have therefore obtained a solution containing n + 1 
arbitrary constants; we can choose these to satisfy specified 
conditions at a generic but fixed point O of S n . E.g. suppose we 
wish to take the .x°'s in such a way that all the w *s are zero at 
the origin. We have from (29) 

Uj = c (xj - aj); 

hence every must vanish, so that (29) becomes (substituting 
for b from (23") ) 

?•«■'+ 5 (30) 
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We can then determine c so that at the origin u « 1; for 
K 

this we must have c = — , and we thus get finally 

u = 1 + f S,x 4 ? .... (30') 

This value of u makes ds' 2 take the form given by Kiemann: 
%dx* dx?+ dx?+ . . . + dx 2 

We shall show farther on (§ 2, p. 246) that the ds 2 of any 
V n whatever of constant curvature K can be put in the form 
(31), and also if K < 0, in the form (26); this will justify the 
choice of the term canonical forms for these expressions. 

Here we shall also prove the almost obvious property that a 
hypersphere of radius R in Euclidean space of n + 1 dimensions 

constitutes a V n of constant positive curvature K = 3^. To 

do this we shall take y 0 , y L , y 2 , . . . y n to denote orthogonal 
Cartesian co-ordinates in S il+l , so that 

ds* - S.rfy* (32) 

o 

Without loss of generality we can consider only the hyper- 
sphere whidi has its centre at the origin, and is therefore repre- 
sented by the equation 

ky; 2 = R 2 (33) 

0 

We shall prove the assertion in the most direct way, by 
expressing the n + 1 co-ordinates y of the points of the hyper- 
sphere, connected by the relation (33), in terms of n suitable 
curvilinear co-ordinates sr, and showing that ds 2 takes the required 
canonical form (31) when these parametric expressions of the 
jy's in terms of the x's are substituted in (33). 

The parametric representation of the y*& referred to is an 
immediate generalization of that given for an ordinary spherical 
surface by stereographic projection. In this case (n = 2), if 
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we project from a point whose co-ordinates are y 0 ~ — R> 
y x =s y 2 = 0 upon the tangent plane at the diametrically 
opposite point, every point (y 0 , y lf y 2 ) of the sphere projects into 
a point on the plane whose co-ordinates x v x 2 are connected 
with the y's by the relations 1 



where 



u =1 + ?^, K = ]p p"-!,.,'. (35) 



For any value of n, we shall adopt the same formulae, with 
the obvious modification that v is to vary from 1 to n. This does 
in fact give a parametric representation of our hypersphere; 
for squaring and adding the equations (34), and substituting 

n 

for xfits expression in terms of u as given by (30'), namely, 
1 

4 

(u — 1), we get back to equation (33). We have then, 

K 

differentiating, 

, 1 2du 

dy„ - x "f ( V ^ l,2,...n). 

4 4 
Squaring and adding, and substituting (u— 1) and ~du 

for E lf a^and 2 S,, x v dx v respectively on the right-hand side, wc 

1 - 

get finally 

which is the required result. 

1 These relations can easily lie shown to be the same as those ordinarily used if 
we replace p, the radius vector of the projection, by the colatitude ^ of the point 

on the sphere. As by definition Vi* y 0 = cos&, it follows that - as cos 2 A&, 
p at 2R tan£S. 

(l>666) 
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CHAPTER IX 



Differential Quadratic Forms of Class Zero 
and Class One 



1. Forms of class zero (or Euclidean forms). 

In Chapter V, p. 123, we defined the class of a given V n (or 
of the quadratic form ds 2 which characterizes it) as the number 
N — n, where N is the minimum number of dimensions of a 
Euclidean space in which the V n can be immersed. 

We shall consequently say that a quadratic differential form 



is of class zero (or is Euclidean) if it is possible to substitute for 
the n variables x a set of n variables y (since N = n), connected 
with the x's by the relations 



Vv = y v ( x v x 2> • • • x n) (v = 1,2,... n), (2) 
and such that (1) assumes the Cartesian form 



Given (1) we wish to find a criterion which will enable us to 
recognize whether such a transformation is possible. We shall 
show that it is sufficient to construct Iiiemaim's symbols relative 
to (1), and to determine whether they vanish identically or not. 
We have already seen (Chapter VII, p. 178) that this condition 
is necessary; we wish to prove that if, inversely, all Riemann's 
symbols relative to (1) are identically zero, then (1) can be trans- 
formed into (1'); or in other words that the n functions (2) can 
be so determined as to satisfy the \n{n + 1) equations 



ds 2 = lk a ik dx l dx k 



a) 



i 



n 



(1') 



1,2, 



(3) 




(4) 
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By oovariant differentiation of (3), we get 

n 

0 = ^Ay,\uVu\k + y v \iV v \kiY 

By cyclic permutation of the indices i, k y Z, we get from this 
the two further equations 

0 ^ s v (y K |Ky,u + y v \ky r \u), 

71 

0 = EAy v \uty*\i + y v \iy v \ik)- 

Now add the last two of these equations and subtract the 
first. From the commutation rule (§ 6, p. 184), combined with 
the vanishing of Riemann's symbols, it follows that the second 
derivatives are permutable, so that we get 

n 

^ v y>.\ucy v \i = o. 

Keeping i and k fixed, and making I vary from 1 to n, this 
formula gives us n linear homogeneous equations in the n un- 
knowns y v \ ik (v= 1, 2, ,.. n). The determinant of the system 
is certainly not zero, since it is composed of the terms y x \ h 
i.e. is the functional determinant of the transformation (2); we 
therefore conclude that 

y,l* = 0 (v, h k = 1, 2, . . . n). . . (5) 

These equations, which we have deduced from (3), can be 
put in the form 

d ^ £ = %{ihj)y vU =/ H *<s|y,| lf . . . (5') 

dx A i 

in which we are concerned only to the extent of observing that 
the right-hand side is a known function of position and of the 
terms y v{j . 

It is now easy to see that the problem is reduced to that of a 
mixed system of total differential equations and equations in 
finite terms which we have already considered in § 8, p. 29. 

In fact, considering as unknowns the n quantities y v9 and the 
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n % quantities y v ^ y we can collect together the equations (4) and 
(5') into a system of total differential equations 



dy v = Zj c y vlk dx k9 



i 



n j 



(v, i = 1, 2, . . . n), (S) 



while the group (3) constitutes \n{n + 1) relations in finite 
terms between the n 2 + n — n (w + 1) unknowns. 

The conditions for complete integrability, by the usual rule, 
are as follows: 



(a) -^-"1* = dy ^ h 
dx h dx k 

(6) ^'l'* = d/i.|tt 
9% 3 a?* 



(y, i, h, k — 1,2,... w); 



(c) the equations obtained by differentiating the equa- 
tions (3) must be identically satisfied in virtue of the 
equations (S). 

Introducing the covariant derivatives and once more apply- 
ing the commutation rule (or the second derivatives, the con- 
ditions (a) can be written in the form 

Vv\kh — y v \hk — linear combinations of Riemann's symbols, 

and it will then at once be seen that they are satisfied identically, 
since the left-hand side vanishes in virtue of (5), and the right- 
hand side also vanishes, since by hypothesis Rieraann's symbols 
are zero. 

A similar argument holds for the conditions (6), which are 
equivalent to 

Vv\ikh — y v \ihk — linear combinations of Riema nil's symbols. 

Lastly, taking the covariant derivatives of (3), we find the 
conditions (c) in the form 

n 

a>a\i ^ ^u(y v \uy v \k+ y»\itfv\H)> 
i 

and it can at once be verified that all these are satisfied, in virtue 
of Ricci's lemma and the equations (5). 
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The mixed system is therefore complete, and it will be possible 
to find the functions (2), which will contain \n(n + 1) arbitrary- 
constants, this being the difference between the number of un- 
knowns and the number of equations in finite terms. In geo- 
metrical terms, if the manifold is Euclidean there are in it oo* n(n+1) 
(orthogonal) Cartesian systems. If we can find a particular 
solution 7) v i72> - • • Vn> we can g e t the most general solution by 
a substitution of the type 

n 

Vi = c, + Sjayty (* = 1, 2, . . . n), . (6) 

where the a's are the coefficients of an orthogonal substitution; 
i.e. are connected by the \n{n + 1) equations 

S ( a v a„ = 8; (j, v 1, 2, . . . n), . . (7) 

1 

while the c\s are n completely arbitrary constants. 

This can be immediately proved from the characteristic pro- 
perties of orthogonal substitutions. In fact, from equations (6), 
differentiating, squaring, and adding, we get 

n 

dy t ------- ^ja,jdrj jy 

n 

i 

n n 

23i(rfyJ a = ^a^a^dTjjdrj^ 
1 i 

summing the last of these with respect to i and using (7), we get 

n n n 

l i i 

The hypothesis that the t/'s are a particular solution of the 

u 

system is expressed algebraically by the equation S r drj u 2 ^= ds 2 ; 

i 

hence we can write 

S t (^)2 = d8*, 

1 

which proves that the y 9 n rdso constitute a solution. An easy 
calculation shows that the number of independent constants in 
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(6) is \n(n +1), and hence the solution so obtained is the most 
general. 

It is obvious that the equations (6) are a generalization 
of the formulae for changing the co-ordinate axes in ordinary 
analytical geometry. 

2. Conf ormal representation of a manifold of constant curvature 
on a Euclidean space. Mutual applicability of all F n 's with the 
same constant curvature. 

In the preceding chapter (p. 236) we solved the following prob- 
lem: given a Euclidean space S ll9 to find a manifold V' u of given 
constant curvature which can be conformally represented on S„. 
We now propose to prove that conversely, given a manifold V n 
of constant curvature, it is always possible to represent it con- 
formally on a Euclidean space S u . In other words, if ds 2 is the 
line element of a V n of constant curvature K, we wish to prove 
that a suitable function U -- = er T can be so chosen that 

ds'* - e lT ds 2 = A<fc 2 

is Euclidean. 

The necessary and sufficient condition for this is that the 
equations (18') of Chapter VIII, p. 232, should all be satisfied 
by putting ( . . My = 0 

(zj, hk) = K(a ih a Jk — a ik a jh ), 

and writing U instead of u. U must therefore satisfy the 
- 1 i 5jW 2 (w 2 — 1 ) equations 



[ yz — A J {a ih a jk — a ik a jh ) 

= a ih U 4 - a* - a jh r« + a 

Putting 



if **U~ J " U + * V 



U ik = a ik (aU+ fi), .... (8) 

where a and /? are two constants, and following the same method 
as that used in § 7 of Chapter VIII, p. 236, we see that these 
equations are satisfied provided ultimately 

+ * (9) 
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If we consider the equations (8) as defining all the derivatives 
of the n quantities U i9 then together with the identity 

dU = EU.dXi (8') 

1 

they constitute a total differential system in the n + 1 functions 
U i3 U; the equation in finite terms (9) is to be associated with 
it. It is easy to verify that we need only take a = — K in order 
that this mixed system may be completely integrable (cf . Chapter 
II, p. 29). ' 

In fact, the conditions of integrability of the equations (8) 
are expressed by the commutation formula? (§ 6, p. 184) 

U i]kl - U l]lk - - S r {ir, hi) U r9 . . (C) 

and those of the equations (8') by 

U ik = U kit 

These latter conditions are at once satisfied, on account of 
equations (8). The left-hand side of (C), also by (8), reduces 
to 

<*K* U t — a u U k ) y 

and the right-hand side, using the expression (19') of the pre- 
ceding chapter for Riemann's symbols for manifolds of constant 
curvature, becomes 

-K(a tk U t -a a U k ). 

The equations (C) therefore reduce to identities provided, as 
stated above, we take a — — K, 

Lastly, there is the equation in finite terms (9); putting 
a — — K, this becomes 

2, U J Uj = — KU* + 20U. . . . (9') 

Differentiating this, using formula (16') of p. 152, and taking 
out the factor 2, we get the conditions 

ijU'Ujt = -KUUi+fiUt, 

which are also identically satisfied in virtue of (8). 
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Remark. — Having thus seen that the system is completely 
integrable, we know (§ 8, pp. 29-33) that the solution contains 
n arbitrary constants which we can choose in such a way that 
at a specified (but perfectly general) point O of the manifold the 
n functions E/, : take values arbitrarily fixed in advance. Further, 
the constant /? is still at our disposal. 

We get a first class of solutions if w 7 e take /J = 0, which 
makes (8) into 

U* = -Ka ik V. 

The hypothesis £ — 0 is therefore admissible in the real 
field only when K < 0; in fact, for fi - = 0 the equation (9') 
reduces to 

AU = - KU*. 

In the real field the left-hand side is always essentially positive, 
excluding the case when the function U is a pure constant, or 
in other words (on account of equations (8), which now reduce 
to U ik = — A* a lk V) retaining the conditions If 4- 0. Since 
the right-hand side has the opposite sign to K, it follows that 
the equality is possible only if K < 0. 

In order to have a generally valid solution, wc must suppose 
f3 =4= 0. Wc shall then choose j8 and the n other constants so 
that 

JJ % = 0, U 1 (i ], 2, . . . n) 

at the point 0, so that from (9') we see that fi — , and U will 
be completely determined. 

With the notation of the present problem (i.e. using dashes to 
denote quantities relative to the Euclidean space) we proved in 
Chapter VIII, § 7, p. 236, that if a factor u exists such that the 
manifold for which 

has constant curvature K, and if the conditions u — 1, u t =- 0 

are satisfied at a specified point O (which may always be supposed 

taken as origin of Cartesian co-ordinates), then the expression 

for u is K n 

u= 1 + , S t x t ? 
4 i 
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Further, we have now found that the quantity i satisfies 

all these conditions (in fact ~ — 1 at O, ^1 ^ — — ~± % = 0 
at O), and therefore we must have 



" i + fw 

4 1 

An extremely important corollary can be deduced from the 
foregoing results. Given two n-dimensional manifolds with the 
same constant curvature K, both their ds 2 , as we have seen, 
can be reduced by suitable changes of variables to the same 
canonical form 

1 

where 

4 1 

It is therefore possible by a change of variables to transform 
one form into the other; or in other words, if two manifolds of 
the same number of dimensions satisfy the single condition of 
having the same constant curvature, then either can be conform- 
ally represented on the other. 

3. General remarks on hypersurfaces in Euclidean space. 
Second fundamental form. 

Let S ll + 1 be a Euclidean space and y v y 2> . . . y n + x a system 
of Cartesian co-ordinates in it, so that 

ds 2 = S„ dy v ? 
1 

Consider a hypersurface V n (frequently called merely a 
" surface " when there is no danger of ambiguity) immersed in 
£ n + 1 and defined by the parametric equations 

y v — VA*v x * • r • x n) (v = 1. 2, . . . n + 1). . (10) 

(D655) 9* 
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As usual, the functional matrix of these equations must have 
n as its characteristic (cf. p. 87). 

As an obvious extension of the ordinary case (n = 2) we shall 
first define the direction of which is normal to V n at any 
given point P. 

Let a, (i>~ 1, 2, denote the cosines of the direc- 

tion we are in search of, relative to the axes y (i.e. the parameters 
or moments, which are indistinguishable in a Euclidean space). 
These cosines will be connected by the usual quadratic identity 

n + l 

Z„a, a = 1 (11) 

1 

The geometrical property which we have to express is that 
the direction whose cosines are a, is perpendicular to any tangent 
to V n at P, or, which is the same thing, to any elementary dis- 
placement dP which is a tangent to V h and therefore (neglecting 
infinitesimals of higher order than the first) does not move out- 
side the surface. For every such displacement the equations 
(10) must still be satisfied, but the increments dx t of the sc's 
will be otherwise arbitrary. If dy v denotes the corresponding 
increments of the Cartesian co-ordinates y v > the a's must satisfy 
the equation 

%a„dy v - 0 (11') 

for every system of rfy's given by (10), i.e. by 

it 

dy v = E t y^dx l9 (12) 

with the dx's arbitrary. 

Substituting in (IT) we have 

n n + l 

i i 

and since the dec's are arbitrary this means that the a's must 
satisfy the n equations 

n+l 

S,«,y,|i = 0 (?== 1,2, ...«). . (12') 

1 

These equations, together with (11), determine the a's except 
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as to sign. The ambiguity of the sign is natural, as we are dealing 
with a direction and have made no hypothesis as to its sense. 
In what follows we shall suppose the sense fixed in advance as 
may be most convenient. 

We know that the metric of V n is defined by the quadratic 
form 

n 

<f> = ds 2 = H ik a lk dxidx k . 
1 

Jn addition to this it is useful to consider a second differential 
quadratic form 0 which differs from the first in that it depends 
on the configuration of V n in S,^ (or in other words is not an 
intrinsic element), or rather completely determines this con- 
figuration. 

To find this function we suppose an infinitesimal segment of 
constant length € measured off along the positive sense (as defined 
in advance) of the normal at every point of the given V n . The 
extremities of all these segments will lie on a hypersurface V' n > 
which is said to be parallel to V u \ there is an obvious one-to-one 
correspondence between points on one and points on the other. 
We wish to consider two infinitely near points of V 1t , and to com- 
pare their distance apart ds with the distance ds r of the two 
corresponding points of V', r 

If the co-ordinates of a generic point of V n are y v (v — 1, 
2, . . . n + 1), those of the corresponding point of V' n will be 

y' v = <J» + * v 

From this, differentiating and remembering that c is a con- 
stant, we get 

Squaring and adding, we get ds' 2 ; denoting it by <f>\ we have 

</>' - n £l(dy*+ e*da?+ 2crfy r <7aJ. 
1 

ji+i 

Now T> v dy* ~- <f>, and since e is infinitesimal it follows that 
i 

e 2 da 2 is negligible compared with the other terms; hence 



f - - - 2€*fs (13) 



252 ABSOLUTE DIFFERENTIAL CALCULUS 

where we have put 

n+l 

0 = -X v dy v da v (14) 

1 

Formula (13) gives the increment of the first fundamental 
form <j> in passing from the given V u to an infinitely near parallel 
surface; this increment is expressed in terms of the quantity 
if/, which, as we shall now see, is a quadratic form in the dx 9 &. 
To show this, we note that 

u 
n 

da y — ?u k a v \ k dx k \ 
hence, substituting in (14), and putting 

T> + 1 

but = hi = — VZ ¥ (y*\t* ¥ \k + y v \k* v \i)> • (is) 

n 

we get 0 = ^bxdXidXk (14') 



This is what is called the second fundamental form. Its coeffi- 
cients b ih , given by (15), can also be expressed in another way, 
which will be useful farther on. Differentiating (12'), we get 

»M 1 

£,( a ri*y,ii + *»y,\») = °> 

or, interchanging the indices I and k, 

^A a v\iy v \k + a vih\ki) = o. 

Taking the half sum of these two identities, and remembering 
the symmetry of the second derivatives, we get 

n+l n+l 

— i^A a v\ky v \i + a v\iy v \k) = ^ v a v y v \ik- 

Changing I into i, the left-hand side of this equality becomes 
the same as the right-hand side of (15), and therefore 

6* =S! adrift (15') 
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4. Forms of class 1 (hypersurfaces in Euclidean space). 

We now wish to find a criterion to determine whether a given 
differential quadratic form 

n 

ds 2 = Y, ik a Lk dx l dx k 
1 

is of class 1, i.e. whether we can find n + 1 functions (10) which 
will reduce it to the Cartesian type. We shall follow a method 
similar to that used in § 1, taking as unknowns the n + 1 functions 
y„ and their n(n + 1) derivatives y v ^, making (n + I) 2 unknowns 
in all. By definition these must reduce the given ds 2 to the 

n I 1 

Euclidean form dy 2 \ this is expressed by the \n(n + 1) con- 
ditions 1 

n + l 

a ik = S r y Hl y Hfr (16) 

From these by co variant differentiation we get the equations 

0 ---- 3*Ay*\uyv\* + y v \iy*\hi)- • • - (i?) 

We have also the condition that the principal unknowns y v 
and the auxiliary unknowns y^t are not independent but are 
connected by the differential relations 

- y, (is) 

We have to determine the conditions of integrability of the 
system composed of (10), (17), (18). 

First, suppose written down the two equations obtained from 
(17) by cyclic interchange of i, k, l\ from these three equations, 
by adding two of them and subtracting the third, we find, as 
in § 1, 

?t + l 

^ v y v \ncy v \i = 0 (i, h 1 = 1, 2, ... n). 

Keeping i and k fixed, we have n linear homogeneous equations 
in the n + 1 unknowns y v \ lk . The matrix of the coefficients 
y v \ h as in the preceding section, has n for its characteristic; 
hence the equations have (n -f- 1) — ■ n = 1 independent solu- 
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tion; the others differ from it by a multiplier. Now we see from 
(12') that we get one solution by taking y^ ik = a v ; hence, intro- 
ducing a multiplier b iJr , we can write the most general solution 
in the form 

To find the significance of these 6's, multiply (19) by a v and 
sum with respect to v from 1 to n + 1, using (11); we get 

n+1 

comparing this with (15'), we find that the 6's just introduced 
(which have the property b 1fc = b kl ) are identical with the co- 
efficients of the second fundamental form. 

We have now to express the fact that the second covariant 
derivatives of the quantities y v > {i satisfy the commutation formula 

n 

Vv\ihk — y v \ikh = — S| {il, hk}y v \ b . . (20) 

which takes the place of the ordinary condition of symmetry of 
the second derivatives. To calculate the left-hand side we must 
start from (19). By covariant differentiation we get 

y v \ikh = a v b ikh + Kk a v \hy • ■ ■ ( 21 ) 

and we have to calculate a lf ^ h . To do this, we note that on dif- 
ferentiating (11) we have 

SX«HA = °> < 22 ) 

and also that the coefficients b Ul can also be expressed in the 
form n+1 

^y v \i**\h = — b £h (i, h = 1, 2, . . . ri), . (23) 

which is at once verified by covariant differentiation of the 
identity n+1 

S r y,,,a, = 0 

combined with the expression (15') for the 6's. If h is fixed, the 
formula (23) represents n linear equations in the n + 1 unknowns 
a v(ft ; combined with (22) they form a system which can deter- 
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mine these unknowns. The determinant of this system is in 
fact 



a n+l 
Vn+l\l 



H\ I u I n 



squaring this, and remembering (11), (12'), and (16), we get the 
determinant [| a lk ||, which is certainly not zero. 

It is easy to verify that the solution of the system (22), (23) is 



(24) 
(25) 



where we have put 

n 

yi = ^kd k y v \k- 

Hence (21) becomes 

n 



The expression for y v \ xhk is obtained from this by interchanging 
the indices h and k. We can therefore write (20) in the form 

n n 

<*>„ (Kk — hi*) — S; (b Lk b Jh — b ih b jk ) y : l S z (il, hk \y y]l . (26) 

In order to express the right-hand side too in terms of yi we 
apply Cramer's usual rule to (25), which gives 

n 

y*-\i = %j a jiyi> 

and substitute this result in the sum; summing with respect to 
I we have 



S £ {il, hk] y Al = Hj (ij, hk) y- 
1 1 

and therefore (26) becomes 

- cl u (b^ - b ihk ) + 2, yi [(6a. b Jh - b* b jk ) + (y , hk) ] - 0 
(v — 1 ? 2, . . , w + 1; i, k, h = 1, 2, . , . w). 



(27) 
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These conditions can be expressed in a considerably simpler 
form. Multiply the equality just written by a v and sum with 
respect to v from 1 to n + 1; remembering (11) and observing 
that from (25) and (12') 



u-fl n w+1 

1 11 



0, 



we get (b ihk — b lkh ) = 0, 

or in other words the coefficients b must satisfy the condition 

Kk = Ku (28) 

The condition (26) can then be written in the form 

hylPij.hk =0 (v 1, 2, . . . n + 1), (29) 



where we have put 

Pihhk = (b ik b jh — b lh b jk ) + (ij, hk). 

Keeping i, h, k fixed, the equations (29) constitute 
n + 1 linear homogeneous equations in the n unknowns 
Pi.hhk 0" = 1. 2, . . . ra). The characteristic of the matrix of 
the coefficients j//, is w; in fact, taking any one of its deter- 
minants of order n, e.g. 



y\ y 



y] yl 



it will easily be seen with the help of (25) that it is equal to the 
product of the two determinants 



y\\n 

y-i\ n 



y»\n 



lo 11 a 12 

I 

1 21 22 

: a a 



a' 



-2» 
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the second of which is certainly not zero. It follows that the 
characteristic of the matrix y[ t is the same as that of the matrix 
y v y, which is n. From a well-known theorem on linear equations 
it follows that the system (29) has no solutions except 

Pij.fuc = 0 {ifj> h 9 k = 1, 2, . . . n), 

which is the same as 

(%j 9 hk) = b lh b jk - . . . (30) 

A more rigorous discussion would show that the formulae 
(28) and (30) express all the conditions of integrability of the 
system. We can therefore conclude that: 

The necessary and sufficient conditions that a given differential 
quadratic form may be of class 1 are that it shall be possible to 
determine a (real) symmetrical double system b ik such tliat 
Riemanns symbols for the given form can be expressed by 
formula (30), and also such that the system b lkh (the covariant 
derivative of b lk with respect to the given differential form) is 
symmetrical (formula (28) ). 

At the end of last chapter (§ 7, p. 236) we found directly, 
by assigning suitable explicit expressions to the functions 
y v (x r , x z , . . . x n ), that every ds 2 of constant positive curvature 
is of class 1. The necessary and sufficient conditions just 
enumerated must of course be satisfied. 

To verify this, we need only take the auxiliary quantities b ik 
in the form \/K a ik , and remember that, as the manifold in 
question by hypothesis is of constant curvature, liiemann's 
symbols (ij , hk) take the form K(a lh a fk — a lk a Jk ). The conditions 
(30) are therefore automatically satisfied. Further, by Ricci's 
lemma, the covariant derivatives of the quantities b ik , i.e. of 
vanish, so that the conditions (28) are also satisfied. 

For K < 0, the hypothesis b ik — %/Ka ik is of no use, as 
it would take us out of the real field, so that we cannot assert 
that the analogous property holds. We can in fact prove that 
for n > 2 a ds 2 of constant negative curvature is not of class L 1 
For n — 2 we know already (§ 21, p. 123) that any ds 2 , and 
therefore in particular a ds 2 of constant curvature, is of class 1 

1 Cf . Rianohi: Lvzioni di ffeomclriti dijferenziale, 2nd edition (Pisa, Spoerri, 
1902), Vol. I, Ch, XIV, § 205, p. 471, 
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(at most), or in other words belongs certainly to some surface of 
ordinary space. There are an infinite number of surfaces of this 
kind (pseudospherical surfaces), with constant negative K 9 in- 
cluding surfaces of revolution of three types. 1 

5. Hyperspherical representation and curvature of a hyper- 
surface. 

Take any hypersurface V n , and consider it as immersed in a 
Euclidean space S n+l ; and consider also a hypersphere of unit 
radius and centre the origin. 2 

We can make each point P of the V n correspond to a point 
P' of the hypersphere by drawing from the centre of the latter 
the parallel to the normal to the V„ at P, and taking the inter- 
section of this parallel with the hypersphere as P'\ V n is then 
said to be represented on the hypersphere. 

The chief interest of this representation is as follows. Let 
V denote the extension (Chapter VI, p. 160) of a region <f> of V u , 
and V the extension of the corresponding hyperspherical region 
V . 

<f>'. Then the ratio — is closely related to the curvature properties 

of V n , and is called the mean curvature of V n in the region <j>. If 
this region reduces to the infinitesimal region round a point 
P— or in other words if the maximum dimension of <f> tends to 

V 

zero— then (if P is not a singular point) the ratio y tends to a 

positive limit F, which is called the hyperspherical (if n = 2, 
the spherical) curvature of the V u at P. 

To find an expression for this quantity, we shall first establish 
a system of intrinsic co-ordinates on the hypersphere. The most 
obvious way of doing this is to assign to each point P' of the 
hypersphere the co-ordinates x x , x 2 , . . . x„ of the correspond- 
ing point P of V n . We shall call the line element of the 
hypersphere da, and shall try to find an expression for it in 
terms of the dz's. 

1 1bid., Ch. VJT, § 103, p, 225; or 3rd edition (Bologna, Zanichelli, 1922), 
Vol. T, Ch. VII, § 127, p. 338. 

2 That is to say, as explained in § 7, Chapter VIII, p. 240, n hypersurface 
O n whose equation in Cartesian co-ordinates is 
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If we denote the direction cosines of the normal to V n at a 
generic point P by a„ (v 1, 2, ... n + 1), as in preceding 
sections, then the direction cosines of the parallel through the 
origin (the centre of the hypersphere) to this normal will also 
be a v . The point P' lies on this line, at unit distance from the 
origin; its Cartesian co-ordinates are therefore a,. 

We then have at once 

da* = S„da„ 2 

1 

n+1 

and putting S,a HA a Hfr = .... (31) 

n 

it follows that da 2 — S /tfr e hk dx h dx k (32) 

i 

This is the first fundamental form relative to the hypersphere; 
it is sometimes called the third fundamental form of the given 
V ir By means of it we can at once calculate the extension V 
of a hyperspherical region 

V = \/ e dx x dx z . . . dx n , 

where e represents the determinant of the quantities e hJe . Analog- 
ously, for the corresponding field of V nJ we have the extension 
Vof<f>: 

V — j ^ s/a dx^ dx 2 . . . dx n . 

If the regions considered are infinitesimal, each integral 
reduces to a single element; taking the ratio of these, we get 

r , u ,r %: W 

where in every case the radicals are of course supposed to have 
their absolute values. 

The coefficients c M can be expressed in terms of the derivatives 
of the y'& by means of (31), which on substituting for a v ^ h the 
values given by (24) becomes 



n n+1 
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and by (9,5) and (16) 

n n+l 

/I. 



] 

M 



From this expression of the e ik *s in terms of the b ik 9 s and the 
a lh ' 9 & it is easy to obtain an expression for the determinant e in 
terms of the determinants a and 6. To find it, put 

Pi (34) 

i 

so that the last of the formula* just given for e hh may be written 
as 

«■** " ft (34') 

1 

Comparing (34) and (34') with the formula) for the general 
term in the product of two determinants, we see that from them 
follow the two equations 

P -= (35) 

e ^ bp, (35') 

where * = \\a lIf \\ and we have put p as can easily be 

a 

verified. Multiplying together (35) and (35') term by term we 
have 

e " - (36) 

a 



Hence (33) becomes 

r~|* 

ia 



(33') 



a formula expressing the hyperspherical curvature T in terms of 
the discriminants of the two fundamental forms. 

It will be seen that the curvature defined here is not an 
intrinsic property, as it depends on the coefficients b lk . 
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Let us apply these remarks to an ordinary surface F 2 im- 
mersed in $ three-dimensional Euclidean space. In this case, 
as we know, there is one distinct Kiemann's symbol (12, 12), 
and (30) gives 

(12, 12) - fc n 6 22 - b x f - 6. 
Hence (33') can be written in the form 

r = i (12 ' 12) 

a 

Comparing this with formula (28) on p. 191, wc sec that 
for n = 2, the curvature F coincides in absolute value with the 
Gaussian curvature K . 



CHAPTER X 
Some Applications of Intkinsic Geometry 

1. General remarks on congruences. Geodesic and normal 
congruences. 

Consider a metric manifold V ni and suppose that at every 
point of V n (or of a region of V n ) there is fixed a direction X, 
defined e.g. by its parameters A'; i.e. that there is given a contra- 
variant system of regular functions A ? (x 3 , x 2% . . . x n ), connected 
only by the usual quadratic identity and otherwise arbitrary. 
On account of this identity one at least of the parameters A' is 
certainly not zero. 

If then we consider the following system of n — 1 differential 
equations 

dx ± dx 2 dx n . 

A 1 " A 2 " ' " Y r ' 

(considering e.g. one of the x's as the independent variable and 
the other n — 1 as unknown functions of the first), we see at 
once that the integrals of this system represent lines of V, t which 
at every point are in the previously fixed direction X; in fact, for 
an infinitesimal displacement along one of these lines, the dx 9 & 
are proportional to the parameters of X. Through every point 
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of the region considered there passes one (and only one) of these 
lines; this follows from the fact that the general integral of 
(1) contains n — - 1 arbitrary constants, which can be so deter- 
mined that for an arbitrarily assigned value of the independent 
variable the other n — 1 variables have values which are also 
arbitrarily assigned. To fix the ideas suppose that in the field 
considered A n is not zero; then (1) can be written 

£ = * (. ........ - 1), 

considering x n as the independent variable. 

It follows from the existence theorem that the integral 
equations 

x t = x t (^J {i -~r. 1,2,... n — 1) 

of the line can be satisfied by an arbitrary set of values of the 
n variables, which is equivalent to saying that the line can be 
made to pass through a point arbitrarily fixed in advance. 1 
Such a system of lines is called a congruence- The quantities 

• dx m 

V = ,-\ where ds denotes the element of arc of the line passing 
as 

through the generic point (x v x 2J . . . x n ) 9 are called the para- 
meters of the congruence, and the elements A/ of the reciprocal 
system are its moments. 

If all the lines of a congruence are geodesies, the congruence 
is said to be geodesic, e.g. congruences of straight lines in ordinary 
space. It is easy to determine the analytical condition which 
expresses this property. We know that the characteristic equa- 
tions of a geodesic can be put in the following form (cf. Chapter 
V, formula (5:*), p. 14 M 

J\i n 

where A* = ~ = x { . 

1 The argument may be made clearer by considering the example of a field 
of force in ordinary physics. In this cane, when a direction X (that of the force) 
is physically defined at every point of the apace considered, then a system of lines 
(the lines of force) is determined which have at every point the direction of the 
force at that point and which, so to apeak, fill all space, as through every point 
there passes one (and only one) line of the system. 
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Now we nave = S z av, 

as i dxi 

substituting in the previous equation and writing everywhere X 1 
instead of x h we get 

P l = + x l - o, 

from which, by (5'), p. 147, we get 

p' =,= (A') { A l =0 (» = 1, 2, . . . n). . (2) 

1 

These are the required conditions. We can express them partly 
in terms of the moments by multiplying by a ik and summing with 
respect to i, which gives 

i 

and as by Ricci's lemma 

a*(A*)/ = 

we get finally 

^=S,A w A<=0 (* = 1,2, ...n). . (2') 
i 

Another important special property which a congruence may 
have is that of being normal, i.e. that of being composed of the 
orthogonal trajectories of a family of surfaces. It should be noted 
here that, given a family of surfaces, there always exists a congru- 
ence of curves which cut all the surfaces of the family at right 
angles and are called orthogonal trajectories; while there does not 
always exist a family of surfaces which cut at right angles all the 
curves of a congruence. This can be shown as follows. 1 

First, let there be given a generic family of surfaces whose 
equation is 

f(x) — constant. 

l lt may be noted incidentally that in chapter V f p. 127, we have already recog- 
nized the existence of the directions normal to the t ami lies of co-ordinate surfaces 
Xi ess constant, and determined their moments. These results could have been 
used here, as any family of surfaces / = constant can always be turned into co- 
ordinate surfaces by a change of variables. The line of argument followed in the 
text has the advantage of giving" directly the explicit expression for the moments 
of the normal directions when the equation of the family of surfaces is general. 
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Consider the surface which passes through a specified point 
P whose co-ordinates are a^, x 2 , . . , x u ; it is understood that P 

is regular, i.e. that the first derivatives are finite and con- 

ox b 

tinuous at P and are not all zero. We wish to show that a direc- 
tion perpendicular to the surface, i.e. to every displacement Sa^ 
belonging to the surface, is uniquely associated with P. 

We first note that for every such displacement 8x { we have 

fix + as) = f{x) 

or S f V. 8x^=0 (3) 

i dXi 

If we denote by A, the moments of the hypothetical perpendi- 
cular direction, then the condition of perpendicularity to every 
displacement in the surface is expressed by the relation 

^XMi - 0, (4) 

i 

which must hold for all values of the Sx t 's which satisfy (3). The 
coefficients in (3) and (A) of 'uxch 8x t must therefore be propor- 
tional (cf. § 3, p. 250). In virtue of the quadratic identity 

1 

the moments cannot all be zero, so that we can suppose that one 

1 df 

of them, say A„, is not zero, and put - = p. Writing f t 

£) f ii it 

instead of ■ for shortness, the explicit relations equivalent to 
(4) take the form 

/ = pK (i — 1, 2, . . . n). . . (5) 

The //s being known, these equations determine the A/s, 
except for a factor, which in turn is determined (except in 
sign) by the above-mentioned quadratic identity, which gives 

n 

^»a- Qikfifk — P 2 * The left-hand side cannot vanish, as by hy- 
pothesis one at least of the //s is not zero; we are therefore sure 
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that p 4= 0. Thus given the family of surfaces / ~ constant, 
the orthogonal direction at each point is uniquely determined; 
the positive sense on this direction can be chosen at will (cor- 
responding to the double sign of />). The A/s being known as 
functions of position, the reciprocal elements A' can be obtained 
from them, and thence, by (1), we get a congruence of lines 
which cut orthogonally the surfaces of the given family. 

Vice versa, given a priori a congruence of lines by means of 
their moments A, (to be considered as given functions of position), 
then in order that the lines of the congruence may be considered 
as orthogonal trajectories of a family of surfaces / = constant 
the necessary condition is that the derivatives of the function 
f(r l9 x 2 , . . . x tt ) (which is a priori unknown) should satisfy (5), in 
which p denotes a factor which is not zero, but is a priori un- 
determined. Such an / does not always exist; we have indeed 
already seen that the necessary and sufficient conditions for its 
existence are (Chapter II, p. 29, formula (23) ) 

(i, j, h = 1,2,... ft), 

where we must now take X { ~ A„ X } = A,, X 1c = X lr Only 
some of these conditions are distinct, e.g. those in which the index 
k has the fixed value n (the conditions (20) of p. 27), the others 
being deducible from them. 

2. Sets of ft congruences. Determination of a vector by n 
invariants. 

We shall now consider n congruences of lines in a generic 
V n ; thus n directions X x , X 2 , . . . X n will be fixed at each point. 
We shall further suppose that every two of these directions are 
orthogonal, and we shall then say that we have fixed in V n 
a set of n orthogonal congruences. 

The parameters and moments of these congruences will of 
course have two indices, the first of which represents the ordinal 
number of the congruence. We shall use the term the congruence 
(h) to denote the congruence whose parameters are AJ, A^, . . . A}J, 
and whose moments are therefore the reciprocal elements A AJ1 , 
^A|2> * • • ^h\n (with respect to the d$* of the manifold). 
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In addition to the usual quadratic identities we shall here 
have the conditions of orthogonality of the congruences. Both 
sets are included in the formula 

|iA A|i Aj = 8* (h, k = 1,2, ...n); . . (7) 

iih— k y this is the usual relation between parameters 
and moments, and if h 4= k it expresses the fact that the 
directions *k h and X A , i.e. the congruences (h) and (&), are 
orthogonal. 

The equations (7) also express the essential fact that the n 2 
parameters X l k of a set of n orthogonal congruences are the recip- 
rocal elements (in the algebraic sense) of the n 2 moments X,,^ 
of the same set of congruences, and vice versa (cf. Chapter IV, 
p. 74; Chapter VII, p. 206). In addition to (7) the equivalent 
formute 

% h K\iK = 8- = 1, 2, . . . n) . (7') 

therefore hold. Multiplying these by a Jk and summing with respect 
to j we get the important formula 

71 

= %hh\ih\k (i>k = 1, 2, ... n) . (7") 

giving the coefficients of ds 2 in terms of the moments of any set 
of n orthogonal congruences. Analogously, multiplying (7') by 
a*, summing with respect to i, and then putting i instead of j t 
we get 

a* = £„ K XI (i, lc = 1, 2, . . . n). . (7"') 

1 

A vector R of our V n is determined, as we know, by its co- 
variant components R, or its contravariant components R i . 
Hence when a set of n congruences is fixed in V n , the vector can 
also be determined by its n projections on the directions belonging 
to these congruences at the point where the vector is considered. 
By definition (Chapter V, p. 126), the projection of R on the 
direction \ h is the invariant 
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which can be expressed in either of the two equivalent forms 

c h = £< (8) 



c A = Zi^K (9) 



1 



Thus the vector R is determined by the n invariants c u . If 
we wish to deduce from these, in a given system of reference, the 
covariant or contravariant components, we need only solve the 
equations (8) or (9), which, together with (7'), give 

= i h c h X{ (8') 

1 

n 

R it = H h c h X h ^ (9') 

i 

If in particular the vector R is the gradient of an invariant 
/ (i.e. if the components R t are the derivatives f. of / with respect 
to the variables x { ), then the invariant c h represents the intrinsic 
derivative of / in the direction of the congruence (h). In fact, 
if s h denotes the length of the arc of one of the lines of the con- 
gruence (A), measured from an arbitrary origin, then for a dis- 
placement ds h along this direction the increment of/ will be 

a 

df = ^f t dx i9 
i 

where the dx 3 & are the differentials corresponding to this dis- 
placement. 

Dividing this quantity by ds h we get by definition the deri- 
vative ^- off in the direction of the congruence (It); remembering 
ds h 

dx ' 

that = A}„ we therefore have 

% = S^AL (10) 



a formula corresponding to (9). Solving it, we get the formula 

/,=■ S^A*,, (10') 
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corresponding to (9'); and lastly, changing to the reciprocal 
elements, we get also 

f = i h df K (io") 

which corresponds to (8'). 

In general, it would be easy to show that when a set of n 
congruences is fixed a tensor of rank m can be determined by 
n"' invariants, instead of by that number of components, co variant, 
contravariant, or mixed, the proof being completely analogous 
to that given above for determining a vector by means of n 
invariants. This result simplifies the study of certain questions, 
so that we shall find it useful to carry somewhat further our 
investigations on sets of n congruences. 

3. Geometrical definition of Ricci's coefficients of rotation. 

We must now introduce a system of differential invariants 
which are closely connected with the set of n congruences. We 
shall reach the required result quickly by the following 
method. 

Consider two very near points P and P' of V n . At each of 
them the lines of the n congruences determine a pyramid (a genera- 
lization of the notion of the trihedron) whose directions are 
mutually orthogonal. Jf X x , . . . X H are the n directions at P, 
those at P' will be Xj' — X t + S'X^ . . . X' t ~ X, t + S'X„, and 
we shall say that we pass from the first to the second by local 
displacement, i.e. by the law previously fixed which regulates the 
behaviour ot the noo (,l_1) lines of the set of n congruences. But 
the pyramid of directions can also be moved from P to P' by 
parallel displacement;, we shall then get at P f n mutually ortho- 
gonal directions X[ — X x S X l9 ... X* ~- X„ + S*X„, which 
will not in general coincide with those obtained by local dis- 
placement. We shall thus have at P' two pyramids infinitely 
near one another, since each is infinitely near the pyramid* 
X x , X 2 , . - . X jr This means in particular that the ith direction of 
one makes an infinitesimal angle with the ith direction of the 

other, and an angle very nearly equal to - with the remaining 

n — 1 directions of the other. We propose to examine these 
infinitesimal differences. 
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Consider two directions X A , X* of the pyramid at P; these 
either coincide (h = k) or are orthogonal, so that we have 

A 

cosX A X fr = S£. 

Let them be displaced to P', the first by local and the second 
by parallel displacement, so that the first will coincide with 
X/ t and the second with X/ t . We shall calculate the resulting change 
in the cosine of the angle between them, i.e. the quantity 

AAA 

ScosX A X /r — cosX^X,* — cosXaXj. 

This is an infinitesimal of the same order as the distance ds 
between P and P r , and we shall therefore write it in the form 
p hfc ds\ thus p Uk will give us a kind of measure of the rate at which 
the cosine in question changes for a displacement in the direction 
PP'. To calculate it, we start from the formula 

A n 

cosX A X* = £ 4 A, ljt -A}, 

and differentiate it, remembering that we have to operate on 
A /l( ; with the symbol 8' (local displacement) and on X\ with the 
symbol 8 s (parallel displacement). We shall get 

a n 

p hk ds = 8 cosX /t X A S t (8' A A , 4 • Ai + A /f , f - 8* A! ). . (11) 

We have also from the ordinary rule of the differential cal- 
culus 

where 8x^ denotes the increment of the co-ordinate Xj in passing 
from P to P', and from the law of parallelism 

1 

Substituting in (11) we get 

p M ds - E^AT^'S*, - X tJl \ hli {jl, i}K** r 

I OX. 1 
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In the second sum interchange the indices i and Z, so as to 
get the same factor A£ Sse, as in the first. We can then write 

p hi ds = ^^-^{jiJ}X hll jK8xj, 
or, remembering formula (5) of p. 147, 

71 

p hk ds = X tJ \ hW \l8Xj. . . . (11') 
Denoting the parameters of the direction PP' by 

g ' ds' 

we have the formula 

P*t = Z.fKwK?. • • ■ • (ii") 

which holds for any direction whatever. 

It is to be noted that p Mf9 as given by the original definition 
(11), changes sign when the two indices are interchanged. This 
can be proved without difficulty, either from the final expression 
(11"), by going back to (7) and taking its covariant derivative; 

A 

or more geometrically, by using the property that any coaX^X* 
is unchanged by either local or parallel displacement, so that the 
formulae 




both hold. 

Carrying out both differentiations and using the results to 
transform (11), we get 

p hk ds = - S t (A A|i -8'A£ + A£-S*A A|t ). 
Further since cosX^X^ can also be expressed in the form 

n 

L t A^A fr |i, (11) is equivalent to 
i 

p hk ds = S, : (8'A£-A fr|< + \l-8*\ kli ). 
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Interchanging k and k, and adding to the previous equation, 
we get the required identity 

Phk + Pkh = 0 (A, ft = 1, 2, . . . n). . (12) 

We shall now examine the case when the direction of dis- 
placement coincides with one of the directions belonging to the 
set of congruences, say the Zth. We shall then have 

e = k 

and, denoting by y hH the value of p h/c in this particular case 

A 

(i.e. the rate of variation of cosX /t X /; for a displacement in the 
direction of X i5 in which X 7t is moved by local, and X A . by parallel, 
displacement), we shall have from (11') 

V 

Yhu r -= ^o X h\oKK (*• *. * = 1, 2 f . . . n). . (13) 
1 

The quantities y were introduced by Ricci, and named by 
him the coefficients of rotation of the set of congruences. They 
have various important properties. 

In the first place, they are invariant, as follows from (13) by 
the law of contraction. Wo have farther, as a particular case of 
(12), 

Yhh + Yku = 0 (A, hl= 1, 2, ... n), . (14) 

which for ft — h reduces to 

Yhm = 0 (15) 

We can also give a direct formal proof of (14), on the lines 
already suggested for the more general case of the p's. Starting 
from the identity (7), and taking the covariant derivative, we 
shall get (remembering Chapter VI, p. 152) 

n v 

AJ. + S £ A k i tj X % h — 0. 
I 1 

Multiplying by A^, and summing with respect to j from 1 
to n, we get n . 

A A j y X k A J + A A .j^- \ h X l = 0 

or Yhki + Ym = °- 
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The number of these invariants y, which depend on three 
indices, is a priori w 3 ; but they are connected by the \n*{n + 1) 
relations (14) of antisymmetry. Hence the number which are 
algebraically distinct is at most 

w 3 _ n>(n +J) ^ n*(n - 1) 

" 2" 2 

The minuend n 3 is equal to the number of the derivatives 
X h \ kl of the quantities X h \ k , and the subtrahend £n a (w + 1) to 
the number of the relations given above as resulting from the 
differentiation of the equations (7) and connecting the w 3 deriva- 
tives. We can accordingly express the derivatives X h ^ d as func- 
tions of the quantities \ h \, £ and y, by solving the equations (13). 
To do this, multiply (13) by X k ^f X t ^ 9 and sum with respect to 
k and L We get 

n n n . n 

^fd Yttki K\i' A z |/ = S,y A,, | S /f X k X k j if A x \ |y 
= A A j 4 y; 

or finally, replacing i' and j' by i and j, 

K\o ^ ^wy^w A/ f |i A;j^ (16) 

This result shows that in order to study the differential pro- 
perties (i.e. the properties depending on the way in which the 
A's vary) of the lines of the given congruences we need only 
consider the invariants y, in terms of which all the derivatives 
of the A's can be expressed. 

The geometrical significance of the y's, which we have already 
illustrated, is particularly expressive in the case of ordinary space. 
In this case the three congruences define at every point a triplet 
of orthogonal directions, and p 23 , p 31 , p 12 are the components of 
a vector o> such that a>ds is the elementary rotation of the triplet 
in the local displacement from P to P'. 1 

J See e.g. Lkvi-Civita and Amaldi: Lezioni dl Meccanica Rationale^ Vol. I, 
p. 178; Bologna, Zanichelli, 1923. For the general case, the reader may be referred 
further to a paper by Signorina CaRPANKSK: " Paralloliwno o eurvatura in una 
variety qualunque", in Anncdi di Mat., Vol. XXVIII, 1919, pp. 147-169. 
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4. Commutation formula for the second derivatives along the 
arcs. 

The invariants y occur in another important formula, which 
we shall now establish. 

We wish to compare the two second derivatives 

3 3/ , 3 3/ 

— — ^- and — — ; 

.ds k ds h ds h ds,, 

we shall find that they are not equal, but are connected by 
a more complicated relation involving also the first derivatives 
and the y\s. 

We have in the first place from (10), differentiating the 

invariant ~- with respect to x y and applying to the right-hand 

3«a 

side the rule for differentiation given in Chapter VI, p. 152, 

3 3/ " i n i 

OXj OS h 1 ] 

We next replace/ 1 in the first term on the right by the expres- 
sion given for it by (10") (putting I instead of h for the index of 
summation), multiply both sides by A^, and sum with respect 
to j. We thus get 

? A * d% (10 = b{ A < A * x " l<j + f ijf,J A * A »" 

By the definition (10) of the intrinsic derivative, the left-hand 

3 3^ 

side of this equation is precisely ~— the first term on the 

right, from the definition (13) of the invariants y, reduces to 

v 3/ v 3/ 

1 os l 1 os t 

We therefore have 

(I)«55) 10 
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with respect to i and k from 1 to n; remembering (13), we get 
Ky (y,*'t' — Ym'v) = 0. ... (19') 

As j may have any value, we can always choose it so that 
4= 0; thus we have 

YnH = Ymk {% k = 1, 2, . . . n — 1), . (20) 

where we have written i, k instead of i\ k\ Reciprocally, if the 
equations (20) are satisfied, the equations (19') follow from them, 
and therefore also (1 9) as a necessary consequence. The equations 
(20) therefore constitute the required condition. 

It is not without interest to find this condition by another 
method, starting from the remark that if the quantities X u ^ £ are 

to be proportional to the derivatives of a single function/, 

ox h 

these derivatives can be substituted for them in the conditions 
of orthogonality 

XiKnK = 0 <A= 1,2, ...rc-1), 
i 

so that the hypothetical function / must satisfy the linear system 
of partial differential equations 

k*kt f - - 0 (h= 1,2, ...n-1). 
i dx h 

Reciprocally, if there exists a function / which satisfies these 
n — 1 equations, its derivatives must be proportional to the 
quantities A /j(; . 

Hence the conditions in question are the necessary and suffi- 
cient conditions that the given n — 1 equations may constitute 
a complete system (cf. Chapter III, § 9, p. 52). 

To make the notation agree with that in Chapter III, we intro- 
duce the linear operators 

X h = S;A1~ (A= 1,2, ...n-1), 
i con- 
noting that (10) shows that these operators are identical with the 
3 

derivatives with respe,ct to the arcs. We thus have the system 
ds h 

X h f = 0 (h = 1, 2, . . . n _ 1), 
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and we have to express the condition that, for h, k = 1, 2, . . . 
n — 1, Poisson's parentheses 

(X h , X k )f X h X k f — X k X h f 

are linear combinations of the terms Xff. 

Now, repeating the steps of the calculation in § 4, or better, 

borrowing from it the value already found for - - we have 

ds h ds k , 

X h X k f £J/ = - ^Y^ + ^MK 

Interchanging h and k, and subtracting, the second sum dis- 
appears. In the first, we must separate out the term correspond- 
ing to the value n of the index Z, and put X t f again instead of 

%. We thus get 

(X h , X k )f ■= S, (y, hk — y lkh ) XJ + (y nhk — y nkh ) . 

1 ds n 

This must reduce to a linear combination of the quantities 
X t f (I = 1,2, ...rc- ]). 

As 5^ is independent of the X,% its coefficient in each of 

the parentheses included in the above expression (i.e. for 
ft, k — 1, 2, . . . n — 1) must vanish; this brings us back to (20). 

It may be noted that if all the n congruences of the set are 
normal, the ys with three distinct indices are all zero. In fact, 
choosing three distinct indices i, h, k, we have the following 
identities: 

ViMc = Vikh* 

Yhki = yhik) 

YkVi — Yhhii 

adding the first two and subtracting the third, and remembering 
that the y's are antisymmetrical in the first two indices, we get 

Yihlc ™ ~~ Yihlei 

^ Yihk 0 (i, K k = 1, 2, . . . n) 
for every triplet of three distinct indices. 
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If we put 

Vij, hk = ^"ViJh ~ ~^V& + ^ { Yifi (Vtok — Yikh) + Yiik Ytjh — Yiih Yijk } > 

we get at once, by definition and the antisymmetry of the co- 
efficients of rotation y with respect to the first two indices, 

Yihhk ~ — YiJ,khi Yihhk — Yji t hk' 

I add, but without giving a proof, that the cyclic identities 

Yi Jr hk + Yih.kj + Yik.jh — 0 

also hold; and from these it follows ultimately that 

Yij.hir = Yhkji (^i, K k = 1, 2, . . . n). 

Ricci discovered all these results as far back as 1895, basing 
his researches with regard to the four-index y's on the analogous 
properties of Rieinann's symbols of the first kind (cf. Chapter 
VII, p. 179). A particularly simple and direct proof has recently 
been given by Dei. 1 

8. Canonical system with respect to a given congruence. 

In many questions a congruence of lines is either among the 
data of the problem, or is closely connected with them. In order 
to deal with these problems it is often useful to associate with 
the given congruence n — 1 others, forming with the given one 
a set of n mutually orthogonal congruences, so that the given 
congruence can be considered as the nth of this set. The choice 
of the n — 1 auxiliary congruences is a priori arbitrary; in many 
cases this arbitrariness may be taken advantage of to introduce 
some simplification. This is possible, as we shall now see; and 
the conclusion we shall reach is that given any congruence what- 
ever, there is always at least one way of choosing the other n — 1 
so that the relations 

Ynki + Ynik =0 (k 4= I; k 9 I = 1, 2, . . . n — 1) (21) 

may be satisfied. 

The system (or any one of the systems) of n — 1 congruences 

3 * c Sulle relazioni differeuziali che lugano i coefficient di rotazione del Ricci 
in Rend, delta B. Acc. dci Limci. VoL,XXXII (first half-year, 1923), pp. 474-479. 
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which possesses this characteristic is called a canonical system 
with respect to the given congruence. 

To prove that such a system exists, we associate with the 
given congruence a system — for the moment any whatever — 
of n — 1 other orthogonal congruences, and fix our attention on 
a generic point P of the manifold; for shortness we shall denote 
by m the pyramid of the n — 1 directions X l9 Xg» • . . X nvl drawn 
from P, orthogonal to X u and to one another. Suppose this 
pyramid rotated round the direction X,„ by which we mean that 
we pass from the pyramid m to another w formed by n — 1 
other directions X^, X^, . . . Xl_ 3 , also drawn from P, and ortho- 
gonal to X„ and to one another. We wish, if possible, to 
determine the rotation so that after it has been effected the 
relations (21 ) may hold. For this we shall start from the relations 
connecting the X^'s with the X>/s, which express analytically 
the rotation described. 

Let a hk (h 9 k = 1, 2, ... n) be the cosine of the angle be- 
tween the directions \ h and \ k . Naturally, if only one of the two 
indices h, k coincides with n, the corresponding a is zero 

^X„ X„, and the corresponding angle is while a nn = 1. 
The formulae for this are 

<*/m = <*ha = 0 (h y k — 1, 2, . . . n — 1); 

We have in any case by definition 

n 

£jA£**u = a M (*» * = *> 2 ■ • • n )» 
and thence, multiplying by A /t | # - and summing with respect to h, 

n 
1 

Limiting k to the values 1, 2, . . . n — 1, for all of which 
a nk = 0, we can take the sum on the right only to n — 1, so 
that we have 

K\i = S A a /iA X h[i (k — I, 2, ... n — 1); . (22) 
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i.e. the moments of xa' are connected with those of w by a linear 
substitution, as could have been anticipated, and the a^/s are 
the coefficients of this substitution. It is also to be anticipated 
that the substitution is orthogonal. To prove this, we take the 
equations (7"); putting k = i 9 they give 

= *a (i = 1, 2, ...n). 

The coefficients a u on the right depend on the co-ordinates of 
reference, but not on the choice of the congruences associated 
with (w). 

Since \ k \ n = 0 for h #= n, it follows that for any value of 
i (=^= n) the expression 

is invariant for rotations of the pyramid w, and therefore the 
substitution defined by the a's is orthogonal. We have now to 
arrange this orthogonal substitution of order n — 1 in such a 
way that the relations (21) may be satisfied. 

To do this we start from (16), from which we get as a parti- 
cular case 

n 

K\ij = ^ki7MK\i\\j (ij = !> 2, . . . n — 1). 

The terms of this sum in which k — n vanish, by (15); those 
in which I = n can be separated out by writing 

71-1 

\i | ij = ^kl Ynkl K + \i \j ^k Ynkn K | 1 • 

The last sum can be suitably transformed by replacing 
by the expression given by (J 3); we then get successively 

n n 

^kYnkn K\i — ^kpqK.\pqKK^k\i 
1 1 

n 

= ^pq \i\pqtfi&i 
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Now in this formula it is to be remarked that the left-hand 
side and the last term on the right depend on the parameters 
and moments of the direction X n alone, and do not depend on the 
other n — 1 associated directions; the same must therefore be 
true of the remaining part, i.e. of the sum 

n-l 

Z kl Y,ikiK\i\\j (hj = 1, 2, . . . n — 1). . (23) 

We can therefore conclude that these expressions are invariant 
for any rotation whatever of the pyramid to. 

Of the (n — l) 2 quadratic forms included in formula (23), 
which are obtained by choosing the indices i, j in every possible 
way, we are interested in any one in which i = j. Fixing the 
index i once for all, and putting for shortness 

K\i = *r (r ---=■ 1, 2, ... n — 1), 
the corresponding quadratic form is 

n-l n-l 

YnH Z/c *i ^ i S w (y nH + Ynlk) *i- • (23') 
1 1 

In this the coefficient of the product z k z t is y M + y H ik> *- e - 
the left-hand sides of (21). If we wish to satisfy (21), we must 
make all the coefficients of the terms in z k z h for w T hich k 4= I, 
vanish by means of the orthogonal substitution (22), which we 
shall write in the form 

z h — *L A a /tk z h ; .... (22') 
i 

this is equivalent to reducing the invariant quadratic form (23') 
to the canonical form 

%PkZ* 2 (23") 

1 

by an orthogonal substitution. This algebraic problem is always 

soluble. In the cases n — 1 = 2 or 3, it corresponds to the pro- 
CD 655) 10* 
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blem of finding the axes of a conic or a quadric, and is discussed 
in ordinary analytical geometry. In the general case the theory 
leads to the following result. 
Consider the equation 

\\UVnM + Ynu)-%p\\ = 0, . . . (24) 

which is of degree n — 1 in the unknown p> and is called a 
secular equation. Its n — 1 roots are always real (it is understood 
that we suppose the quantities y M real), and give the ft — 1 
coefficients p k of the canonical form (23"). 1 

We can therefore always choose, at any point P, the pyramid 
w and therefore the system of the n — 1 congruences (I), (2), . \ . 
(n — 1) so as to satisfy (21); i.e. there always exists at least one 
canonical system with respect to a given congruence. If the 
n — 1 roots of (24) are all different, the canonical system is 
uniquely determined; if they are all equal, any system of n — - 1 
congruences which are orthogonal to one another and to (ft) 
satisfies (21) and may therefore be called canonical. In the 
general case where the number of different roots is p (1 < p < 
ft — 1), then ft — 1 — p coefficients of the orthogonal solution 
are arbitrary, and there are therefore oo"~ 1 ~ 3P canonical systems. 

9. Congruences of straight lines in Euclidean space. Geo- 
metrical significance of the canonical system. 

In ordinary (i.e. Euclidean three-dimensional) space parti- 
cular importance attaches to congruences of straight lines, which 
present themselves for consideration in various questions of geo- 
metrical optics; since the rays of a light pencil (in a homogeneous 
medium) form a rectilinear congruence. 

We shall now discuss a geometrical property of these con- 
gruences, which will be seen to be connected with the discussion 
in the preceding section; or rather — since it involves no greater 
complication— we shall discuss congruences of lines in a Euclidean 
space of any number n of dimensions. 

Consider a generic point P, and let r be the ray through P 
of the given rectilinear congruence; let X be the hyperplane 
(in ordinary space the plane) perpendicular to r at P. Take a 
displacement in X represented by the infinitesimal segment 

1 Compare Chapter VII, p. 205, where references are given. 
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PP' — € in any direction; through P' will pass another ray 
r' of the congruence. In general, the two rays r, r' are skew; 
if for a particular direction of the displacement PP' it happens 
that they both lie in the same plane, i.e. that they meet or are 
parallel (more precisely, that the minimum distance between 
them is an infinitesimal of higher order than e), this is called a 
focal direction. We shall now show that in general there exist 
n — 1 focal directions, all or some of which may be imaginary, 
coincident, or indeterminate; we shall then point out an impor- 
tant particular case in which these directions coincide with those 
of the canonical system. 

Let PP' then be a focal direction; there will be a point C 
(which may be at an infinite distance) common to r and r\ 

Denote the length CP by — (so that we shall have the particular 

co 

case of the rays being parallel at the limit when co — 0), and 
let us take as axes of reference n orthogonal Cartesian axes 
y v (y = 1, 2, . . . n). Let A,^ be the cosines of the direction n (i.e. 
its parameters or moments, since in Euclidean space A„j >f = A*). 
The projection on the axis y v of the segment CP will then be 

given by - A„ k , and that of CP' will be 

CO 

CO \co / 

while the projection of PP' is dy v . If then we express this last 
term as the difference of the other two (PP' being the third side 
of the triangle CPP'), we have 

We now wish to use the methods of the absolute calculus. 
We shall therefore associate with the given congruence n — 1 
other congruences, orthogonal to it and to each other, which we 
shall distinguish by the indices 1, 2, . . . n — 1. In addition to 
the projections of PP' on the axes we require also its projections 
on the set of n congruences so defined; for this we must multiply 
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the last equation by X v h (h =^ 1, 2, ... n) and sum with respect 
to v. First, let h = n\ the projection of PP' on the direction 
n is zero, as PP' by hypothesis belongs to the hyperplane X; 
hence the left-hand side is zero. Further, in consequence of the 
identity 

n 

E„ AjiAfli,, = 1, 
i 

it follows that 2„ K^K\ V = 0; 

i 

hence finally we get <Z * ^= 0, 

which expresses the a priori evident fact that CP = OP' 
(of course neglecting infinitesimals of higher order than the first). 
Putting h in turn equal to 1, 2, ... n — 1, and denoting by e h 
the projection of PP' on the direction h, we find 

1 n 

€ h - ^X;idX n{v {h = 1, 2, . . . n — 1). 

We shall now expand ^A,,^, remembering that since Chris- 
toffePs symbols are all zero, we can replace the ordinary by the 
covariant derivatives, and also that since t 3 , € 2 , . . . € n _ l are the 
projections of PP' on the directions of the set of congruences, 
and dy k (k --= 1,2,... n) its projections on the axes, we there- 
fore have 

dy k — EjCjXf. 
The last formula thus becomes 

j n n-l 

€ h = - S yAr A;, X n j vlt 5Jj € Xj , 

CO 1 1 

n-l n fr 

or we h — 2 e 2^. X n , vk X v h X f . 
ii 

Remembering the definition of the ys we have the system 
of n — 1 equations 

u>e h ZjtjYnhj (A = 1, 2, ... n - 1), . (25) 



CANONICAL AND FOCAL DIRECTIONS 285 



which we can also write 

S- (y^j - 8{a>)^ = 0 (h = 1, 2, ... n - 1). (25') 
1 

This linear homogeneous system must determine the focal 
directions PP f (if they exist) in the hyperplane X, by giving 
their projections e l9 e 2 , . . . e n _ 1 on the orthogonal directions 
1, 2, ... n — 1 which we have associated with the ray r. 

The necessary and sufficient condition that the system (25') 
may have solutions € which are not all zero, is that the deter- 
minant of the coefficients should vanish, i.e. that to should satisfy 
the equation of degree n — 1 

II y,^- 81 a> || = 0 (h,j - l,2,...n- 1). (26) 

To every root to corresponds at least one set of values of the 
e's, i.e. at least one focal direction PP f . Hence in general there 
are n — 1 of these directions, which, however, like the correspond- 
ing roots of (20), may be real or imaginary, distinct or coincident, 
or (in the case of multiple roots) may be capable of having an 
infinite number of determinations. 

Tn fact, the properties of the secular equation, as noted in the 
preceding section, hold for symmetrical determinants of the type 
(24), while the left-hand side of (20) is not in general of this 
form. There is, however, an important category of congruences 
with this characteristic, which we shall now consider. 

Normal congruences of rays. —If our congruence (w) is normal, 
then by (20) 

Ynh} = 7n)h 1, 2, ... n - 1). 

We can therefore substitute \(y n hj~\~Ynjh) f° T Ynh/9 so that (26) 
at once becomes identical with (24), which defines the canonical 
directions. It follows that the canonical and focal directions 
coincide. Hence on the one hand we have the geometrical inter- 
pretation of the canonical directions; and on the other, from 
the properties noted at the end of the preceding section, we have 
the property that the focal directions are always real, and are 
in general determinate and mutually orthogonal; and further, 
that in the case of indeterminateness, when there is an infinite 
number of them, it is always possible (and in an infinite number 
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of ways) to choosd n — 1 of them which shall be mutually ortho- 
gonal. 

As we are dealing with a normal congruence, there exists 
(by definition) a family of surfaces 

f(x v x 29 . . . x n ) = constant, 

which are cut orthogonally by the straight lines of the con- 
gruence; these lines therefore constitute the common normals 
to all the surfaces of the family. If we fix one of these surfaces, 
and associate with every point on it the n — 1 focal directions, 
we shall get n — 1 mutually orthogonal congruences of lines on 
the surface. These lines are called lines of curvature, by an obvious 
generalization from the lines so determined in the case of sur- 
faces in ordinary space (n = 3). Tn fact, given such a surface, 
say a, the normals to it form a normal congruence (since they cut 
a and the surfaces parallel to a orthogonally); and if we consider 
the two focal directions at every point of a we arrive at precisely 
the ordinary definition of the lines of curvature as those lines of 
a along which the normals to a generate a developable ruled 
surface. 

General Case.— If the congruence (n) under consideration is 
not normal, then in general, as we have seen, the focal and 
canonical directions at a generic point P of a ray r do not coincide. 
In order to find an interpretation of the canonical directions in 
this case, we should therefore have to examine in greater detail 
the behaviour of the rays of the congruence which are infinitely 
near r. 

For n = 3 there is a classical discussion by Kummer, 1 giving 
a very illuminating interpretation of the canonical directions, 2 
and pointing out in particular that the directions which bisect 
the angles between the canonical directions also bisect the angles 
between the focal directions (when the latter are real). 

We shall leave the question at this point, merely pointing 
out to the reader the possibility of analogous interpretations for 
n>3. 

3 See e.g. Bianchi: Lczioni di Gcometria Diffcrenziale, Vol. I (third edition; 
Bologna, Zantohelli, 1922), Ch. X. 

2 Cf. T. Levi-Oivlta: "Sulle congruenze di curve", in Rend, della JL Ace. dei 
Lined, Vol VIII (first half-year, 1899), pp. 239-46. 
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Physical Applications 



CHAPTER XI 



Evolution of Mechanics and Geometrical Optics; 
Their Relation to a Four-dimensional World 



1. Hamilton's principle for a free particle. 

We start from the equations of motion of a material particle 
in a conservative field. Let U be the potential for unit mass. 
The equations of motion, in Cartesian co-ordinates (referred 
to fixed axes) y v y 2f y 3> are 



where as usual dots represent differentiation with respect to the 
time t. If we denote the square of the line element described by 
the moving particle in tlie small interval of time dt by 



and if v is the velocity of the particle (in absolute value), then 



according to Einstein 




(t - 1, 2, 3), 



(1) 



dl* - S^yJ 8 




Putting 



L = 



bv 2 + u 
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it is known that the equations (1) can be summed up in the 
equation of variation r 



which expresses Hamilton's principle. 

Let us fix our attention for a moment on (2). It implies an 
interval of integration (£ 0 , fixed arbitrarily in advance; and 
the vanishing of the left-hand side of (2) for variations Sy^ of 
the y's, zero at the extremities but otherwise arbitrary, is equiva- 
lent to the equations (1) being satisfied in the same interval. 

This case, in which t does not vary (i.e. S£ — 0), is the 
simplest application of Hamilton's principle. Various generaliza- 
tions, however, in which t also varies, either freely or subject to 
certain conditions, have become classical. We shall shortly have 
occasion to discuss one of these generalizations which concern 
the equivalence between the equations (1) and (2). Meanwhile 
we may note that if the co-ordinates are changed in any 
way, so that the Cartesians y x , y 2 , y. A are replaced by any 
set of three curvilinear co-ordinates, or more generally by three 
Lagrangian parameters x l9 x. A , connected with y v y 2 , y 3 by 
relations which may involve the time and which are regular and 
reversible in the field considered, namely, 

x h = VhiVi, 0 (A = 1,2, 3), 

or, solving with respect to y t (i = 1, 2, 3), 

(T'zY y t ? -= vA*i> t) (i = 1, 2, 3); 

then if we insert these expressions in L, it becomes a function 
L(x | x | t) of the arguments x h9 x u (h — - 1, 2, 3), t, quadratic 
(in general not homogeneous) in the xs. 

As we propose to consider L as an invariant, it follows that 
(2) will hold for the Lagrangian parameters x, and we have only 
to find its explicit form. Calculating the variation and integrating 
by parts in the usual way, we easily find 




(2) 




1 



(3) 



where for shortness we have put 



d dL dL 
dt dx h dx h 
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(Lagrangian binomials). The dynamical equations then take the 
form 

r h = 0 (h — 1, 2, 3) ... (4) 

(known as Lagrange's form); and it is to be noted that, in virtue 
of the invariance of the left-hand side of (3), the quantities r h 
constitute a co-variant tensor, as pointed out in a similar case 
in Chapter V, p. 1 JO. It follows that the equations (4), i.e. 

are invariant (of. Chapter V, p. 1 10) with respect to the transfor- 
mations (T 8 ) which leave L invariant. 

2. Time as a fourth co-ordinate. Space-time. World lines. 

An obvious consequence of Lagrange's equations (4') is the 
identity 

d ( r 4 dL .1 dL 

1 OXi J 



Now suppose that in the interval (t 0 , t x ) the independent 
variable t is also made to undergo a variation St which is zero 
at the extremities and is otherwise arbitrary. Since the xjf* are 

dx • 

unchanged by this, while the derivatives x, — * undergo the 

increments M 

^ . . dot 

SXi== - Xi dt> 

it will at once be seen that, by ap obvious integration by parts, 
the contribution of the variation of t to 8 jL dt, namely, 

S« + L 4 - --, SXi}, 



can bo put in the form 

J>*{S?-l(*-fS*)} 

which, as we have just pointed out, is zero in consequence of 
(*'). 
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It is therefore possible, in dealing with the Hamiltonian 
equation (2), to apply exactly the same treatment to the space- 
co-ordinates x l9 x 2 , x z and the time t. 

To simplify the argument, consider the four-dimensional 
manifold F 4 corresponding to four parameters x i9 t\ the mani- 
fold, in which space and time are simultaneously represented, 
may be called space-time. 

A set of three equations 



or, in terms of kinematics, a motion, corresponds to a curve 
belonging to F 4 , and reciprocally. Such a curve is called a world 
line, it is an obvious generalization of the plane diagram (in 
which the abscissa is the time and the ordinate the space described) 
used to represent the circumstances of motion in a given trajectory. 
Adopting this expression, we can say that the integral curves of 
the equations (4/) are all those world lines of F 4 , and only those, 



for which the variation of the integral \Ldt vanishes, the ex- 



tremities being fixed. 

3. General transformations of co-ordinates in space-time. 
Simultaneity. 

The most general transformation of parameters in F 4 ob- 
viously includes three equations of the type (T 3 ), which substitute 
for the Cartesian co-ordinates y l9 y 2 , y% three independent com- 
binations of them, x v x 2 , also involving t\ and a fourth 
equation wliich substitutes for the time t a further combination 
x 0 (y v jy 2 > 0 (independent of the tliree preceding equations). 
This new parameter x 0 is sometimes called the local time, as it 
depends not only on the original time, but also on the point in 
question. A transformation (T 4 ) is thus represented by the 
formula: 



is the following. If two events are characterized by different 
values of y l9 y 29 y 3 , but the same value of t y it will in general 
happen that after the transformation is effected, not only the 
space co-ordinates x l9 x 29 x% of the two events will be different, but 



x L = Xt(t) 



(i = 1, 2, 3), 
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also the time oo-ordinates % This implies that two events which 
appear simultaneous with reference to the system y l9 y 2 , y 3 , t 
are not in general simultaneous with reference to the system 
of the x 9 s; simultaneity is therefore relative to the system of 
reference. This evidently does not happen when the first of the 
relations (T 4 ) is of the type x 0 = x 0 (t), or in particular x 0 = t 9 
so that the (T 4 ) reduces to a ( T 3 ). And it is precisely in order 
to avoid any conflict with the intuitive concept of (absolute) 
simultaneity that only transformations of the type (T 3 ) are 
considered in the classical physics. But a more acute criticism 
of this intuitive concept shows that, far from being a logical 
necessity, it has an empirical origin based on experimental 
results which can only be taken as a first approximation; it is 
therefore reasonable, in view of the speculative nature of our 
considerations, to admit the possibility of a more general con- 
ception of simultaneity. 

4. Einstein's form for Hamilton's principle. Its invariant 
character under any transformation of co-ordinates. 

So long as L is taken to be invariant, the form of the integral 
J Ldt is evidently not invariant for a transformation ( T 4 ), since 
in general dt is replaced by an expression linear in all four 
variables x. We might try to replace the base L by something 
more general; it would then be possible to reach the required 
result, but the method would be complicated and infertile, and 
the loss in simplicity both of concept and of form would be 
much greater than the gain in generality. 

But it is not difficult to arrive at a significant form which 
shall be invariant for every (T 4 ) if we regard Hamilton's principle 
as an approximate result, the degree of approximation being of 
course so high that in ordinary applications, astronomical as 
well as technical, the difference between it and the rigorous 
hypothetical principle shall be imperceptible. This will evidently 
be the case if the order of magnitude of the difference between 
the two, with respect to the values given by the ordinary theory, 
is not higher than the hundred-millionth (10~ 8 ). 

A concrete application of this criterion is as follows. Let c 
denote a constant velocity, large in comparison with the greatest 
velocity attained in the motions we propose to discuss. We 
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shall consider quantities comparable with £ = - as small quan- 

c 

tities of the first order, and we shall consider quantities of the 
second and higher orders as negligible in comparison with unity; 

we shall also suppose that the ratio — is similarly negligible. 

c 

We note that this will in fact be the case if c is comparable 
with the velocity of light, not only for ordinary problems of 
terrestrial motion, but also in celestial mechanics. In order to 
see this, we need only suppose that v is a planetary velocity and 
U the Newtonian potential which determines it, so that by a 
well-known result U (in the field of motion of the planet) is of 
the same order of magnitude as v 2 . 

We may take 30 kilometres per second, corresponding to the 
earth's motion in its orbit, as the order of magnitude of v. In 

round numbers, c =■= 300,000 km. /sec, so that wc have - = 10""* 
(approx.), and therefore 0 

- and - = 10 s (approx.). 

cr c 

We shall see farther on, in §§ 8, 9, and 16, that physical con- 
siderations lead us to take for c precisely the velocity of light. 
Since hi must vanish at the limits of integration, we have 

8f tl dt = 0, 

so that L can be replaced, as the integrand of (2), by 
c* - L = c* (l - J? - E). 

The terms — |/? 2 , — ? 9 though negligible in comparison with 
c- 

unity, are essential in order to prevent the equation of variation 
from reducing to an identity. Terms of higher order may however 
be neglected. We may therefore write 

c* - L - c^l - p* - % = cJ*ZT&—mi 
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so that, omitting the constant factor c and writing — 0 instead of 

at 2, 

v 2 9 the equation of Hamilton's principle (which, as just pointed 
out, is equivalent to 8 j(c 2 — L)dt = 0) can be replaced by 

8 jyJc*-^~2Udt = 0, 

or, putting cfe 2 = (c 2 — 217) dt* — dl*, ... (5) 
by 8jds = 0 (6) 

Since the value of dl 0 2 referred to Cartesian co-ordinates is 

8 

the ds 2 just introduced is a quaternary differential 

1 

quadratic form; it is indefinite, since for real and infinitesimal 
values of dt, dy x , dy 2 , dy a it can have both positive and negative 
values. At the same time it is to be remembered that, for the 
phenomena of motion at present tinder consideration, we luive always 
ds 2 > 0. 

To show that this is so, note that, taking out the common 

dl 2 

factor c z dt 2 and again replacing - ~ by v 2 , we can write 

at 

this proves the assertion, since the quantity in brackets is cer- 
tainly positive when the quantitative relations stipulated at the 
beginning of our argument hold. 

We may now note that if the ds 2 expressed by (5) is con- 
sidered as the square of the line element of the manifold F 4 
(which contains both space and time), then (6) represents the 
characteristic equation of geodesies of F 4 (cf. Chapter V, p. 130). 
It is true that the metric of this manifold is characterized by an 
indefinite quadratic form, but, as was pointed out on p. 142 of 
Chapter V, this does not introduce any real complication so long 
as we limit our considerations to lines wholly constituted of 
elements for which ds 2 > 0, as it is in the present case. We can 
therefore say that the proposed modification of Hamilton's 
principle imposes a metric limitation on the space-time manifold 
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F 4 » and that the mechanical problem of the motion of a free 
particle under the action of forces derived from a potential has 
been transformed — with an alteration of the laws of dynamics 
which is quantitatively very small — into the purely geometrical 
problem of the determination of the geodesies of a certain four- 
dimensional metric manifold. 

If for the arguments t, y l9 y 2 , y 3 we substitute any four inde- 
pendent combinations of them whatever, x 0 , sr^, x 2 , x 3 , by means 
of a substitution ( T 4 ), ds ?t will lose the special form (5) and assume 
the general type of a quaternary quadratic, 

s 

rf* 2 = ^ik9iu^i^k> • • • • (5') 
o 

whose ten co-efficients g ik (~ g ki ) will naturally be, in general, 
functions of the x 9 &. 

The essential point is that, ds 2 being invariant, (6) is also 
invariant for any choice whatever of co-ordinates in V 4 . This 
constitutes a marked superiority of (6) over the original form of 
Hamilton's principle. From the conceptional point of view it is 
also to be noted that this change realizes Einstein's fundamental 
concept of general relativity, which requires that it shall be possible 
to express the laws of any physical phenomenon whatever in 
a form which is invariant for every possible choice of co-ordinates, 
both of space and of time, without the time having to hold the 
privileged position assigned to it in the classical theories. 

5. Mass and energy: views suggested by the modification of 
the dynamical law. 

We shall examine in detail the form taken by the dynamical 
equations of the free material particle when the classical Lagran- 
gian function L is replaced by. the function 

which we shall write briefly in the form 

L* = — c*K y 
putting 
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Substituting — &K for L in Lagrange's equations, they 
become d d ^ _dK = Q . 



or suice 



we have also 



_ _ ft 

dy t c*K 



(7) 



Remembering that J? differs by very little from 1, we see 
that quantitatively these equations differ by very little from 
the equations (1). Considering them from the point of view of 
form, and comparing them with the cardinal equation of classical 
dynamics ,^ 

dt 

(where Q is the momentum and P the force), we see that the 
momentum per unit mass of the old theory is replaced in the 

new by the vector whose components are For a particle of 

mass m 0 and velocity V the vectorial expression for the momentum 
will therefore be 

If we wish to retain the formal property that the momentum 
is the product of the mass by the velocity, we must take as the 
mass not the constant ra 0 , an intrinsic property of the body in 
motion, but the quantity 

which will be seen to depend on the velocity and the field of 
force. Neglecting the latter so as to fix the attention in the first 
place on the motion as it depends on the velocity, we reach the 
expression 

m - m ° 

~ v. - ? 
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from yrhich it appears that m increases as the velocity increases 
and would tend to infinity if the velocity could reach the value 
c. In this sense we say that the typical velocity c, introduced 
to give an invariant form to Hamilton's principle, is a limiting 
velocity. 

We now proceed to examine the concept of energy in the 
light of relativity mechanics. 

In the classical mechanics, given a generic Lagrangian func- 
tion L(y | y) (where L does not explicitly contain the time t) 9 
the corresponding expression for the energy is 

H^X.^y^L; .... (8) 

in the case where L can be broken up into a part T(y \ y) homo- 
geneous of the second degree in the tjr's, and a part U independent 
of them, this becomes, by Euler's theorem, 

II - T{y\y) - U(y). 

It is known that T can be interpreted as the kinetic and 
— U as the potential energy. Since we have replaced the classical 
L by the expression 

£* —cs/c*-v*~2U = -c 2 #, 

we must now determine the new expression //* for the energy 
per unit mass. 

Applying (8) we get 

3 9 L* 
//* - ^—fa-L* 



substituting from equations (7) ami using the expression for K 
we get finally 

V>-S-f 



We see therefore that the energy cannot be divided into a 
part due to motion and a part due to position. Further, for 
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v = U — 0, the energy does not vanish, but remains equal 
to c 2 : a remarkable fact, the interpretation of which will be 
seen in a moment. 

Expanding the radical in series we can write 

— ('+£+?+■■■> 

and therefore, retaining only terms of the second order, 

To this degree of approximation, therefore, the energy is 
composed of a kinetic part expressed as usual by -Jv 2 , a part due 
to position which is still given by — U, and in addition a con- 
stant part (i.e. a part independent of both position and velocity) 
equal to c 2 ; this last part is called the intrinsic energy of unit 
mass. A material particle of mass m 0 (at rest or moving under 
no forces) will thus have intrinsic energy m 0 c 2 . Now considera- 
tions of a different nature lead us to assign to this intrinsic 
energy a much more profound significance than that of a mere 
additive constant of conventional value; it is in fact taken to 
represent the effective atomic and molecular energy stored up 
in the body to the extent of 25 million kilowatt-hours for every 
gramme of matter. The possibility of the existence of this enor- 
mous quantity of latent energy is shown by phenomena of 
radioactivity: a sufficient example is the fact that any small 
mass of radium is capable of giving off for years and years, 
without perceptible modification, enough heat to raise an equal 
mass of water from 0° C. to boiling-point in every hour. The 
supply of heat would last for a very long time; more than 2500 
years for radium, and for other radioactive elements a period 
comparable with geological epochs. While radioactivity is not 
a general property of all bodies, yet it demonstrates the fact that 
(at least in certain cases) matter contains an enormous store of 
energy, and in this form the assertion can be generalized so as 
to extend to every atom of ponderable matter. 

Admitting the possibility of the existence of this intrinsic 
energy, the foregoing considerations result in our assigning 
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to it tie vaJue m 0 c?. If instead we return to the expression (9) 
for the total energy, and suppose that the potential U is zero, 
we find for the total energy (kinetic and intrinsic) localized in 
a body whose mass when at rest is m 0 , the expression 



E 




and remembering the expression for the mass m as a function 
of the velocity we can also write this as 

E = mc 2 (10) 

This result shows us that there is a proportional relation not 
only between the mass of the body when at rest and the intrinsic 
energy, but, more generally, between the mass and the total 
energy localized in the body. It also suggests the hypothesis that 
to any form of energy there must be assigned a mass connected 
with it by the relation (10); and, vice versa, that every mass m 
corresponds to a quantity of energy mo 2 . This hypothesis is 
supported by other considerations, and leads to the view, of 
primary philosophical importance, that energy and matter may 
be considered as different manifestations of one single entity, 
which appears as ordinary matter when it is, so to speak, suffi- 
ciently concentrated, while it appears as energy in widely different 
forms when there are no condensation nuclei present. 



6. Einstein's form for the principle of inertia. Restricted 
relativity. 

The equations of motion in the original Newtonian form (1) 
imply, as is well known, a state of uniform motion when the forces 
are zero or, which comes to the same tiling (except for a non- 
essential constant), for U = 0. Equation (2), which is rigorously 
equivalent to (1), therefore defines states of uniform motion for 
U = 0. This property also holds for the new Einsteinian form 
(6) of Hamilton's principle, though it is not rigorously equivalent 
to equations (2). Before proving this we may point out that, for 
U =s 0, (5) gives 

ds 0 * = #dp-a 0 * (ii) 
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By a mere change of the unit of measurement of time (the 
advantage of which will be seen shortly), i.e. by putting ct = y w 
this quadratic form becomes 

ds* = dy* - dl*, 

and referring the space to orthogonal Cartesian co-ordinates, 
dsj = dy* - dy* - dy? - dy*. . . (1 1') 

This is analogous to the ordinary expression for the ds 2 of 
a Euclidean F 4 in orthogonal Cartesian co-ordinates, except for 
the signs of the co-efficients, which make it indefinite; in this 
case the index of inertia 1 is 3. Hence the V 4 with a metric of 
this kind is called pseudo- Euclidean; the system of co-ordinates 
y Qi y l9 y a , y 3 , which gives this form to ds 2 , is called pseudo- 
Cartesian or Galilean. 

It will sometimes be convenient to put the expression for a 
pseudo-Euclidean ds 2 back into the general form (5'); for this 
purpose we introduce the symbols 

(1 for * k = 0, 
- 1 f or i = k 4= 0, 
0 for i 4= k 

(the notation being similar to that introduced in the note on 
p. 55 of Chapter III), We can then say that in pseudo-Cartesian 
co-ordinates the co -efficients of ds* are 

Sat = SI*. 

We then have also, as is easily verified, 

g ik = 9* - K" (12) 

Returning to the property enunciated at the beginning of 
this section, we note that, for U — 0, the expression for L m 
becomes 

2?- -o^-itf, 

2 Tho index of inertia is the number of negative coefficients of a quadratic 
form when expressed (in any way) in the canonical form (i.e. so that it contains no 
product terms). 
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and that (6), which becomes 

Sfds 0 = 0, (6') 

can be written 

8/ L m Q dt — 0. 

The corresponding Lagrangian equations, from the fact that 
Lq does not depend explicitly on the y'a, at once give the three 
first integrals 

- = constant (i = 1,2, 3), 

whence there follows the constancy of all the jy/s (the principle 
of inertia). Now consider a particular, but very important, 
category of transformations (T 4 ) specified as follows. From 
the set of four co-ordinates (t, y l9 y 2 , y :i ) we pass to a new set 
(t 9 y l9 y 2 , y 3 ) for which the form (11) of ds 0 2 remains unchanged, 
this being understood in the sense that the transformation 
formulae are to give identically 

ds* = ^dP — ^tdy? = c2tfP — S,« » 2 
i 

The equation (6') then ensures that in the new co-ordinates 
also, interpreting t as the time and y l9 y 2 , y 3 as Cartesian co-ordinates 
the motion will appear uniform {restricted relativity). 

Transformations of this kind were effectively constructed by 
Lorentz, so that they may be called Lorontz transformations; 
we shall denote them shortly by (A) and discuss them fully 
in Section 8. Meanwhile we may indicate the characteristic 
property, pointed out by Professor Marcolongo, that, if we put 

4 

v/ — 1 ct = ?/ 4 , so that ds* takes the Euclidean form — 2 4 dyi 2 , 

4 1 

a Lorentz transformation leaves unchanged the form S^rfy,- 2 , and 

i 

thus (here too apart from any question of imaginaries) is sub- 
stantially identical with motion in a four-dimensional Euclidean 
space. 

To close these remarks on the effective existence of these 
special transformations (A) we may note an important corollary. 
Every (A), as we have said, transforms a generic uniform motion 
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into a new motion which is also uniform; but it is not possible 
to assert that the velocity is unaltered by the transformation. 
There is, however, at least one case in which this happens, namely, 
motion in which the velocity is that very large constant velocity 
c which we originally introduced in order to modify Hamilton's 
formula in a way which should be quantitatively imperceptible, 
but fertile in its results. 

In fact, for a motion in which the velocity is c (with respect 

dl 2 

to the parameters t 9 y l9 y 2 , jy 3 ), we have obviously c 2 = and 
therefore = = * *" 



In view of the invariance, not only of ds 0 2 , but also of the 

special form c 2 dt 2 — S 4 dy* which we have given it, we have, on 
1 

passing to the new variables i, y v ;y 2 , y z by a Lorentz transfor- 
mation, 

(fidP — Xidgt 2 = 0 
1 

for the transformed motion as well as for the original one, and 
therefore the velocity is c. 

7. The kinematics of rigid systems. Ordinary method of 
approach and possible variants. 

In the foregoing sections we have been led to modify (very 
slightly in ordinary conditions) the dynamics of a material 
particle P, i.e. the relation between the motion and the disturbing 
force. Nothing however has been, or need be, modified as regards 
the kinematics, i.e. the description of the phenomenon of the 
change of position of a point P with respect to an assigned 
observer S, or in other terms with respect to a Cartesian system, 
in a certain interval of time. For convenience (the reason for 
this choice will be clear in a moment) we shall denote these axes 
of reference by O y x y 2 ;y 3 , and the time by L 

The equations of motion of P, 

& = (i = 1,2,3), . . . (13) 

the velocity V as a vector of components (i — 1, 2, 3), the 
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acceleration, &c, will all be as in the ordinary case. In particular, 
there is uniform motion when V is constant, i.e. the y^s are 
linear functions of L In this case, taking one of the axes, say- 
that of the y ± % parallel to V, the equations (13) can be put in the 
simplified form 

9i = y°i + vt, y 2 = yl y z - y% . . (14) 

where v obviously denotes the velocity in the scalar sense (the 
component of V along y x ), and y^ y%, y% are the initial values 
of the co-ordinates y v y 2 , y z of the moving point. 

As is well known, there are in ordinary kinematics two ways 
of defining rigid motion and of investigating its problems; these 
are briefly as follows: 

(1 ) A rigid system is defined as a system consisting of any 
number of points P, P', . . . of co-ordinates y h y i9 . . . 
(i = 1, 2, 3), which move in such a way that their mutual 
distances apart remain unchanged; i.e. so that for any two points 
whatever of the system, P and P' 9 and for any movement of 
these points, the relation 

s 4 (&-?7:) 2 - & 

holds, the quantity d 2 on the right being constant (geometrical 
characteristics of the moving system). 

In these relations and in their differential consequences 
are summed up all the properties concerning simultaneous 
positions, velocities, &c, of the various points of the 
system. 

(2) The ground covered by the equations just given for the 
relations between pairs of points is, so to speak, divided into 
two parts, the first expressing the intrinsic circumstance (i.e. 
independent of the system of reference S) that when the 
moving system changes its position with respect to £ it keeps 
its configuration unchanged. This is equivalent to the pos- 
sibility of placing at the points P, P', . . . an observer S 
rigidly attached to the body, who can be represented as usual 
by an orthogonal trihedron Oy 1 y ft y 3 with respect to which 
the position of each separate point of the moving system remains 
unchanged. In other words, the co-ordinates y iy y', ... of 
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these points with respect to these axes attached to the body do 
not vary with the time. 

At this stage the argument usually is as follows. In order 
to determine the position, with respect to the original system of 
reference 2, of the whole moving system at a generic instant, it 
is only necessary to place the trihedron Oy x y z y z (attached to 
the body) in its proper position with respect to Oy x y 2 y 3 . 
Thus we again have to deal with transformation formulas (variable 
from moment to moment) between two systems of orthogonal Car- 
tesian axes, and therefore of the type 

y L « h«*yk + M) (*' = 1, 2, 3), . (15) 
1 

where denotes the cosine (variable with the time, if the motion 
is not one of pure translation) of the angle between the fixed axis 
Oy i and the moving axis Oy k9 antl fc(t) is a function of the time 
(linear if the motion reduces to a uniform translation). 

The proposition in italics, or the equivalent group of formula 
(15), constitutes the complement of what may be called the 
intrinsic rigidity of the body (the existence of the trihedron 
attached to the body); the combination of the two gives us once 
again the kinematics of a solid body in its classical form. 

But if we analyse this complement a little iurther, we find 
that we can modify to some extent the ordinary idea of the 
motion of a solid body without giving up either intrinsic rigidity 
or the validity of Euclidean geometry. 

We need only introduce the hypothesis (independent of both 
the geometry and the kinematics of the point) that the measures 
of the distances between the points P, P', . . . (and therefore also 
of angles) of our solid may differ according as they are made 
by an observer attached to P, P', ... or by the fixed observer 2. 
While granting that the two observers may disagree as to the 
measures, it is to be borne in mind that, by hypothesis, the 
measurements are made by each of them in accordance with a 
Euclidean metric, and that (as in the classical scheme of things) 
the rigidity of the motion must always be respected, from the 
point of view of the fixed observer £ as well as of the other. 
This requires that every distance apart of two points P, P\ 
. • . of our system must remain unchanged in time, whether the 
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distance is calculated by S or by the other. For this it is neces- 
sary and sufficient that the transformation formulae between the 
y's and the y'&, 

Vi = fi (2/i> y<» 0 (i = 1, 2, 3), . . (16) 

where the //s are a priori unknown functions, should be such as 
to make 




independent of t at every instant, whatever may be the differen- 
tials dy £ . 

We get an obvious case in which this condition is satisfied if 
we suppose that the transformation formulae are linear in the 
y/s (though not necessarily with respect to t), and in particular 
that they are of the form 

8 

Vi = 2*o,*jfr + 4M0> .... (18) 
i 

where the c's are completely arbitrary constants, subject only 
to the qualitative condition that their determinant || || does 
not vanish. It should be remembered that in the equations (15) 
the coefficients a ik were in addition direction cosines (in some 
cases variable with the time) for two sets of orthogonal 
axes. 

A transformation of the type (18) between the y's and the 
jy's is, at every instant (i.e. for any assigned value of t), linear, 
and therefore homographic, or rather affine, so that straight lines 
are transformed by it into straight lines. From our point of view, 
this means that curves which appear to the observer S to be 
straight lines are so also for the observer 2, and inversely. 

It would not be hard to show that, if we impose on a trans- 
formation (16) the double condition of making dl 2 or the right- 
hand side of (17) independent of t, and also of keeping geodesies 
unchanged, we necessarily reach, if not an affine transformation 
(18), at least the product (in the sense of a product of operators) 
of an affine transformation by a rigid motion in the ordinary 
sense of the term. By a suitable choice of the trihedra of reference, 
the passage between the y's and the y 9 s can thus be effected by 
applying in succession the two following transformations: 
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(1) An affine transformation given in its canonical form, 

1. e« by means of equations of the type 

til = hvi> = hy* y& = hvs> • (19) 

where the k's are positive constants; 

(2) a transformation (15) between the y'& and the y' 's. 

We shall not spend time on the elementary considerations 
which lead to this conclusion, and shall merely point out that the 
coefficients k of the equations (19) determine the deformation 
consequent on the affine correspondence between the y 9 s and 
the y' \s, while in the second change, from the y' 's to the ^'s, there 
is no further deformation. 

It follows from this, taking (19) into account, and considering 
the two observers S and 2, that a segment having the same direction 
as the axis Oy i9 and of length 1 with respect to the observer 2 attached 
to the body, mil appear to the observer 2 as having the length kjl; 
hence the factor is called the coefficient of elongation. The 
" elongation " of unit length is accordingly k f — 1; this represents 
an expansion or contraction according as k t > or < 1. The 
formulae (19) of course provide, more generally, information as 
to the alteration in length of segments (and therefore of vectors) 
in any direction whatever. If the coefficients a t (i — 1, 2, 3) 
are the direction cosines with respect to the axes Oy x y 2 y z of a 
generic segment and I the length as it appears to the observer 

2, we obviously get, for the length I as estimated by 2, 

Returning for a moment to the ordinary equations (15) of 
rigid motion, we shall fix our attention in particular on the most 
elementary case (which will serve as a guide and a basis of com- 
parison in the argument of the next section), that of uniform 
translatory motion. We can then take the trihedron of reference 
0$! y 2 #3 with one of its axes, say y v parallel to the direction 
(by hypothesis constant) of the velocity, and we shall take the 
trihedron Oy x y 2 y% attached to the body as coinciding with the 
fixed trihedron at the initial instant t ~- 0. 

The motion being translatory, the axes attached to the body 
remain parallel to the corresponding fixed axes throughout; and 

( 1> 065 > 11 
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if v is the velocity of translation, the formulae determining the 
motion evidently reduce to 

§ 1 = Vi + vt > V% = V2> Va = Vz- • • 

We have thus again reached the typical equations (14) for 
each point (y l9 y 2 , y 3 , constant) of the body. 

4 8, Romerian units. Study ol Lorentz transformations. 

The equations (15'), which define in the simplest form an 
ordinary uniform translation, can obviously be associated with 
the identity 

i = t. 

We thus get a quaternary transformation between (y v y 2 , y 3 , t) 
and (y l9 y 2 , y 3 , 1), which we shall denote by T. 

We next observe that the most general representation of a 
uniform translation, with arbitrary choice of the two trihedra 
(one fixed, the other attached to the body), subject to the sole 
condition that the origins coincide initially, can be reduced to T 
together with two rotations independent of t. In fact, denoting 
as before the two trihedra of reference (fixed and moving with 
the body) by S and S, we shall denote by R a rigid rotation of S 
(round the origin O) which brings its axis Oy x into the direction 
parallel to the velocity of translation. Let R' be an analogous 
rotation (round O) of the trihedron S; and R the inverse rotation. 
Then the transformation formulas between (y v y 2 , y 3 , t) and 
(y v y 2 , y z , t) are represented by the symbolic product 

RTR. 

Now consider the well-known kinematical deduction from the 
classical method of representing rigid motion, namely, that if 
we consider any velocity cu whatever with respect to S (c being 
the modulus and u the versor), this becomes cu + v with respect 
to 2, if v is the vector representing the velocity of translation of 
S with respect to S. This, as has been observed, is in contra- 
diction with the results of experiment, at least as regards the 
velocity of light, for which c in cm. per second has the par- 
ticular value 3'10 1Q , which remains unaltered, even when com- 
pounded with a uniform translation (Michelson-Morley experi- 
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merit). The wish to restore concord between theory and experi- 
ment leads us to modify the equations (15'), and with them, if 
necessary, the equation i = t, in such a way that the relation 
ds 0 2 ds 0 2 (not merely dV* = dl 2 ) shall be rigorously satisfied; 
i.e. so that there shall be an identity between two quadratic 
forms involving not only the space co-ordinates but also the 
time. 

Special transformatiwis. We propose to try to modify thefie 
transformation formulae (as usual very slightly, at least for small 
values of v) so as to make ds 0 2 invariant. For this purpose we 
shall have to replace t sometimes by the variable 

y 0 ct 

and sometimes by the imaginary variable 

y 4 = Let (l — s/ — f). 

With this change, putting also — j8, the equations (15') and 
I = 1 become either c - 

£1 = ^1+ Plto, $2 = V%> £3 = y*> §0 = Vo • ( 2( >) 

or 

$1 = Vi- <>Py* y% = y* & = y& y* = (20') 

The real variable y 0 introduced here is only the time measured 
by choosing as unit the time taken by light to traverse unit 
space. Thus the velocity of light is 1 and the dimensions of time 
become the same as those of length. The character of a primary 
magnitude ordinarily assigned to the time thus disappears, the 
unit of time being linked up with the unit of length by means of 
the phenomenon of the propagation of light. It will be convenient 
to apply the term " Romerian " 1 to measurements of time 
made in this way; we shall similarly use the term " Romerian 
velocities " (which are pure numbers) for velocities referred to 
the Romerian time ?/ 0 . It obviously follows from the equation 
y 0 = ct that a Romerian velocity is only the corresponding 
ordinary velocity divided by c; in particular, the quantity 

1 From O. ROmek (1644-1710), who was the first to discover and determine 
the velocity of light. His method was based on observation of the eclipses of 
Jupiter's satellites. 
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f} = - just introduced is only the Romerian velocity of trans* 
lation. 

In accordance with Marcoiongo's remark quoted on p. 300, 
the transformations we are in search of must leave invariant the 
differential quadratic form ds 0 2 , which we can write (introducing 
the imaginary variable y 4 ) in the form 

— ds 0 z = dy? + dy£ + dy£ + dy*. 

In order to obtain particular transformations satisfying this 
condition, we shall first consider linear homogeneous trans- 
formations. These will at once result, as we have already pointed 
out in the preceding section, in the condition (17) being satisfied, 
which interprets the transformation as equivalent to a rigid 
motion (if not in the ordinary sense, at least in the intrinsic sense 
there specified). As we are dealing with linear (and homogeneous) 
transformations, the invariance of the differential form — ds^ 
implies that of the algebraic quadratic form 

- ? = Vi + v£ + y£ + y?> 

and reciprocally. 

Starting from the equations (20') we shall examine whether 
we can reach the required result if we keep the co-ordinates 
y 2 , y 3 invariant, i.e. if we suppose 

y* = y* $z = y* 

We have thus to find a linear transformation between the 
variables (y v y 4 ) and {y v y 4 ) which will leave invariant the expres- 
sion 9 , 2 

y* + y* • 

Hence (apart from the question of imaginaries) we have to 
discuss a rigid rotation, roimd the origin of co-ordinates, in the 
plane y v y 4 , and therefore of the form 

y x ^= y x coskf> — y 4 sin<£, 
fit =--= y x &m<f> + y 4 cos<£. 

If we introduce the real variables y Q) y 0 instead of y 4 , y 4 , it 
will be seen that the necessary and sufficient condition for the 
disappearance of imaginaries from the ultimate formulae is that 
the coefficient of y x should be real and that of y 4 imaginary in 
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the first equation, and vice versa in the second. To obtain this 
result, <f> must be a pure imaginary; in fact, putting 

<f> = n/j (with tft real), 

we get 

COS^ = COS10 — COSh^r, 

sin^ = sinaft =- tsinh^r, 

where cosh0 and sinhi/r as usual denote the hyperbolic cosine and 
sine. 

Hence our transformation formulae take the form 
$1 — Vx cosher + y 0 sinh^r \ 
32= ?h I . . . . (21) 

Vz =•= y 3 

?7o — ?/i sinh^r + ?/ 0 cosh0 

If we remember that in the equations (20) the pure number 
jS is in ordinary cases fairly small, we Gee that in these cases the 
equations (21) differ quantitatively by very little from the 
equations (20), provided we suppose 0 sufficiently small for cosher 
and sinh</r not to differ by very much from 1 and 0 respectively. 

But we get a precise kinematical interpretation of the para- 
meter ifj on which the transformation (21) depends if for instance 
we fix our attention on the origin O of the moving axes, i.e. on 
the point whose co-ordinates are y x =-» y 2 = y z — 0, for a 
generic value of y 0 , this last parameter denoting the time 
(Romerian time) as it appears to the observer 2. For the fixed 
observer 2, with respect to whom y 0 represents the time (likewise 
Romerian) and y v y 2> y :i the position, we have, corresponding to 
O and a generic value of y 0 , 

y x = y 0 sinh^r, y 0 = y 0 cosher, 

while y z and y 3 vanish. Hence the motion of O is rectilinear, and 
the ratio 

= tanhiA, 

Vo 

which is obviously constant, is the (Romerian) velocity. Denoting 
this ratio by /?, we have in the equation 

tanh^ = P 



3io 
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the required kinematics! significance of the parameter 0. More 
generally, the same quantity /3 stands for the Romerian velocity 
of any other point P rigidly attached to L. In fact, if y v y 2 , y a 
are constants and y 0 a generic value, the result of differentiating 
the equations (21) is to give 

dy t = smhtp dy 0 , dy a = dy a — 0, dy 0 = conhp dy 0 , 
whence it follows that 

f J - tanh^ = 0. 

Q.E.D. 

Applying the ordinary formulae 

, . , tanh 
cosh = -. , smh = —.- , 

VI — tanh 2 x/l tanh 2 

we can put the equations (21) in the form commonly used (the 
special Lorentz transformation) 



Jl Vl-jS 2 

ft y* 
y% = «/ 3 
,t _ y 0 + Pyi 



(21') 



2/o = 



or, using the ordinary instead of the Romerian unit of time 
(i.e. i and £ instead of y 0 and with jy 0 ~~ d, y 0 = ct), 

+ ' 

7. «• 



& ■= 



c 2 



y a = y 2 
y* = % 



(21") 



It will be seen that the necessary condition for these formulae 
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to be real is /P < 1, or v < c; which once more demonstrates 
that the velocity of light c is a limiting velocity. 

It can be easily verified that the formulae obtained from 
(21') or the equivalent (21") by solving these equations with 
respect to y l9 y 2l y 3) and y 0 or t differ from the first set only by 
the change of v into — v (and therefore of j8 into — /?), and of 
course of the two sets of variables; precisely as happens for the 
equations (15') and (20) which refer to an ordinary translation. 

v 2 

If in particular we suppose /? 2 , i.e. (but not necessarily /?), 

c 

negligible in comparison with unity, the first three of the equations 
(21") reduce to the formula) (15') of the ordinary translation, 
while the fourth gives rise to an additive term denoting the 
difference of time between the two observers S and S, expressed 
by the equation 

It will be seen that the additional term ^ y x depends on the 

position of the point at which 2 has to apply his own measure- 
ments of the time; for this reason t is called the local time. It 
was associated by Lorentz with the ordinary uniform translations 
(15') with the intention of explaining to a first approximation 
(i.e. neglecting jft 2 ) the character of electromagnetic phenomena 
for bodies in motion; this requires explicitly that the relation 
dsj* — cLsq should hold to the same order of approximation. 
Later on Lorentz himself discovered the equations (21"), which 
result in the rigorous invariance of ds^. Einstein rediscovered 
them from the point of view of this invariance, which is the 
mathematical expression of his principle of relativity in its most 
elementary form. 

Let us examine the formulae (21"). They contain the best- 
known results (to some of which eminent students of relativity 
have assigned paradoxical consequences) of the kinematics of 
relativity. In the first place, the non-invariance of t, as noted 
above in § 3 in general for any (T 4 ), points to the necessity of 
abandoning the ordinary concept of simultaneity in the absolute 
sense. In fact, two instantaneous events, taking place at two 
different points of space, may correspond to the same value of 
t but not of t (a sufficient condition is that the y x 9 s should be 
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different), and may therefore be simultaneous for one observer 
who uses 2 as his system of reference and not for another who 
uses 2. Hence the time ceases to be an absolute quantity and 
becomes relative to the system of reference and connected up 
with the space co-ordinates; it is in fact local time, to use the 
term already referred to as having been introduced by Lorentz 
in his researches on the electrodynamics of bodies in motion. 

Suppose that two events take place at the same point P of 
the body (and therefore with the same y v y z> y 3 ), but not at the 
same instant, being separated by an interval of time At (measured 
in the system 2): for the observer 2 the interval will be At, and 
the relation between the two is given at once by the fourth of 
the equations (21"), noting that y ± is constant, so that 

Hence for the observer who accompanies the point P where 
the phenomena take place, the interval of time At is shorter 
than for the fixed observer 2; i.e. we haye a slowing down of 
the time with respect to 2's measure, as if the unit of measure- 
ment had become — -.— — — : _ times that used by 2. 
s/l - p* 

Similarly two events which happen at what is for 2 the same 
point (i.e. with the same y x , y 2 , y 3 ) but separated by an interval 
of time A/7, will appear to 2 to be separated by a longer interval. 
This follows at once from the fact pointed out above that the 
inverse forniulaB of (21') and (21 ") are found by changing v into 
— v and therefore f$ into — /?. 

We shall now try to determine the difference, if any, in the 
estimates of lengths made by two observers 2 and 2, each at a 
specified instant of his own time. Suppose, for instance, we wish 
to carry over to the observer 2 measurements made by 2. 
Substituting in the first three transformation formulsB of (21") 
the value of t in terms of i given by the fourth, we get 



vt 
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which may be considered as resulting from the product of the 
affine transformation 

y'i = V 1 ~ 2fe = ?/ 2 > y'z = y 9 

by the ordinary translation 

$1 = y'\ + vi, y 2 = y 2 , £ 3 = 

In this form of the equations the change in length is at once 
obvious. We need only refer to the conclusions of the preceding 
section, noting that the elongation coefficients Jc l9 k 2 , k 3 of the 
formulae (19) are in this case represented by 1 — /J 2 , 1, 1. 
Hence if the fixed observer estimates distances at a generic 
instant i, and if his results are compared with those of the observer 
attached to the moving body, who is also estimating the same 
distances at any instant whatever of his own time, then the former 
observes a contraction, in the ratio s/\ — jB 2 : 1, for longitudinal 
segments, i.e. in the direction of motion, while there is no change 
for transverse segments, i.e. perpendicular to the velocity of 
translation. 

The inverse formulae, for the change from S to 2, differ, as 
we have already said, only by the change of v into — v. Hence 
the same rules hold good; e.g. fixed segments in the direction of 
motion will appear to the moving observer as contracted in the 
ratio s/ 1 — fp \1 m comparison with the measurement of them 
made by the observer 2; and so on. 

General transformations. We propose lastly to prove a result 
analogous to one shown above to hold for the translations of 
classical kinematics, namely, that the most general Lorentz 
transformation (A) (i.e. a linear transformation between two 
sets of four variables y t and y t (i = 1, 2, 3, 4) for which the 
quadric q remains invariant) can be represented in the symbolic 
form 

R £ R, 

where R and R are ordinary orthogonal transformations (rota- 
tions) between (y v y 2 , y 3 ) and (y l9 y 2 , £ 3 ), and £ is a special 
Lorentz transformation of the type studied above. 

<D655) 11* 
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The transformation (A) will be a quaternary orthogonal 
transformation of the type 

4 

y h = Stojutjfe (22) 

i 

whose coefficients a hk constitute an orthogonal matrix, i.e. such 
that 4 

X k a hk a jk = 8>, (23) 

£ k a kh a kJ « 8> (23') 

(A,j - 1,2,3,4). 

In order that the variables y l9 y 2 , y z , y l9 y 2 , y z> may be 
real, and jy 4 , y 4 pure imaginaries, we must evidently have a hk 
(h, k < 4) real, a A4 and a 4/l (h < 4) pure imaginaries, and a 44 real. 

We shall of course interpret y v y 29 y 3 as Cartesian co-ordinates 
with respect to a trihedron K rigidly attached to S, and j/ 4 as 
the time variable; and similarly for the jy's. 

The directions of the trihedra K and K are a priori arbitrary; 
we shall now determine a rotation R for K and a rotation R 
for K such that we shall ha ve 

A - E>£R. 

To do this, we consider, with reference to K , the vector whose 
components are a 14 , a 24 , a 34 ; let i denote the relative versor. If 
we turn the trihedron K round in such a way that its axis y x 
takes the direction of I, we shall have 

a 24 =-"= a 34 = 

we shall take this as the rotation 7J. 

Now from the identities (23) and the values just given it 

follows that 3 8 

E*a 2 k = 2> k aal =1, 
l i 

3 

Q-Vc a 3fc — 0, 

i 

so that the two vectors determined (with respect to K) by the 
components a^, a Zk (k = 1, 2, 3) are of unit length and ortho- 
gonal. We shall call them j, k, and shall take as the rotation R 
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the rotation which turns the trihedron K so that its axes y a , y 3 
coincide in direction with the vectors j, k, so that we get 

a 2i = <ha — a si = a 82 = 0. 

As a result of the two rotations R and R the form of the 
matrix of the a's comes to be 



'Hi 


a 12 


a 13 


a 14 


0 


1 


0 


0 


0 


0 


1 


0 


a 41 


a 42 


a 43 


a 44 



and, from the group properties of orthogonal substitutions, this 
matrix must also correspond to a substitution of this kind (since 
it is the result of the product of the original substitution by two 
rotations). A consequence of this is the vanishing of four other 
elements of the matrix: in fact, the conditions that the first line 
of the matrix shall be orthogonal with the second and third lines 
respectively are 



*12 — a 13 



0, 



and similarly, taking the fourth line with the second and third, 
we get 



a 42 = a< 



'43 



0. 



Thus we finally get the matrix in the form 



a ll 


0 


0 


«14 


0 


1 


0 


0 


0 


0 


1 


0 


a 41 


0 


0 


<*44 



which corresponds to a transformation of the type (21), i.e. to 
a special Lorentz transformation jQ, We have thus shown 
that, through the two rotations R (for the trihedron K) and 
R (for K) f the general transformation (A) reduces to a special 
transformation jQ. 

So far we have considered only linear transformations. The 
question may be raised whether we should not get greater 
generality if this restriction were removed. In this connexion 
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we shall merely say 1 that the linear transformations studied 
here are the only ones which, in addition to retaining the 
invariance of <is 0 2 , make finite values of the y'a correspond to 
finite values of the y's, and vice versa. 

9. Relative motion. Composition of velocities. Kinematical 
justification of a formula of Fresnel's. 

In order to show the relation between the various aspects of 
a single motion — let us say specifically the motion of an assigned 
point P — with reference to two different observers S and S, it 
is only necessary to use the transformation formula? between 
the corresponding co-ordinates. This holds both in ordinary 
kinematics and in relativity kinematics, with the reminder that 
for the latter the time y 0 is among the co-ordinates affected by 
the transformation. 

Consider in particular a Lorentz translation, which, as we 
have seen in the preceding section, is defined by the formulae 
(21"), suitable choice being made in advance of the two trihedra 
which represent the observers and are denoted by S and S. 

Now suppose that the motion (which we can call relative) 
of the point P in relation to 2 is given; i.e. that the expressions 
Viilfo) ( ? * " 1> 2, 3) for its three space co-ordinates are known 
formally as functions of y 0 (the Romerian time). To obtain a 
representation of the absolute motion, i.e. the motion with 
reference to S, it is obviously sufficient to find the expressions 
for the co-ordinates y ( (i — 1, 2, 3) of the point P as functions 
of the new time variable y 0 . The transformation formulae (21') 
give the required result at once; in fact, if we insert in them for 
the y/s the expressions y i (y 0 ) belonging to the moving point P, 
all the y's become known functions of y 0 ; and if we suppose this 
parameter found from the fourth equation 

and substituted in the first three, we get the equations of absolute 
motion in their explicit form. 

1 For the proof, cf. C. Munari: "iSopra una eHpressiva interpretozione cine- 
nmtica del Principle di Relativity in Rend, delta R. Ave. dci Zdncei,Vol. XXIII 
(1914), p. 781. 
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The resulting relation between the absolute and relative 
velocities is especially interesting. The vector rule no longer 
holds that the absolute velocity = the relative velocity + the 
velocity of the moving origin (the latter, in the case of trans- 
lations, of course reduces to the velocity of translation, whatever 
may be the instantaneous position of P). The relativity com- 
position of velocities is a little more complicated. In order to 
see what happens in the clearest case, we shall consider a relative 
motion parallel to the translation v. With this hypothesis the 
co-ordinates y 2 and y z of the point P are constant, and, from 
(21'), y 2 and y% are also constant, or, in other words, the motion 
with respect to S is also in the direction of the translation. 
Differentiating the first and fourth equations of (21'), we get 

dy Q +pdy t 
*** " Vl~F' 
Putting for the sake of shortness 

a ... d Vi B _ *Vi 

so that f3 a and /J r are the velocities (scalar and Romerian) of the 
point P with respect to £ and £ respectively (absolute velocity 
and relative velocity), the foregoing formula), on dividing the 
first by the second, give 

this is what is called Einstein's law for the composition of 
velocities. Multiplying by c and remembering that y 0 = ct, 
y 0 =zz cl, we can evidently replace the Romerian velocities 

fia — j^r> Pr = t~ * P> by the corresponding ordinary velocities 
Va = If' Vr == d^' V> Write 

v * i+w; ' • • ■ (24) 
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If both the velocity v r of P (with respect to S) and the velocity 
of translation v are small in comparison with c, the denominator 
differs from unity by a term of the second order; if we neglect 
this difference we get back to the fundamental relation of ordinary 
kinematics (which may be called Galilean) 



in view of the criterion we are applying, this result was of course 
to be expected. In general the equation (24) shows that, for )8 | 
and | /3 r j less than unity, | fi a | also < 1; while, for | fi or 
| )8 r | equal to unity, | fi a | also = 1. To prove this, note that, 
whenever | £ | < 1 and | ji r | < 1, 



is always positive, so that ^SJ 2 = I p J~£\ < 1; while for 



| j8 | = 1, or | j8 r | = 1, fi a 2 = 1; which proves the required 
result. We thus find once more the limiting character of the 
velocity c of light: however near v r may be to c, provided it is 
less than c (j8 r < 1), if it is compounded with another velocity of 
translation v 9 less than c, but as nearly equal to c as we please 
(| (3 | < 1), the result will always be less than c, or in other 
words | fi a | always < 1. Vice versa, the velocity c for S remains 
c for any 2, whatever may be the velocity of the (Lorentz) 
translation with which the two observers are moving with respect 
to one another. 

Within the scale of velocities of ponderable bodies (velocities 
small compared with c), the relation (24') reduces sensibly to the 
Galilean formula v a — v r + v, as we have already said. But 
when the phenomenon of motion under consideration is the 
propagation of light in a transparent medium, so that the velocity 
has an order of magnitude comparable with that of c, then the 
divergence between the Einsteinian and the Galilean kinematics 
becomes striking, and lends itself to experimental verification. 

Einstein has in fact drawn from this a magnificent argument 
in support of the theory of relativity. He deduced logically (by 
a purely kinematical proof x ) from (24') a formula of Fresnel's 

1 Even before Einstein, Lorentz had given a theoretical justification of FresneVs 
formula, based on his celebrated electron theory of the electromagnetic phenomena 
of bodies in motion. Einstein's explanation is plainly more attractive. 



v a — v, + v\ 



(1 + OB = <l_F)(l-jV) 
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concerning the movement of light waves through transparent 
media in translatory motion; a formula which was experimentally 
confirmed for the first time by Fizeau (1851), whose experiments 
were repeated with improved methods by Michelson and Morlcy 
and by Zeeman. 

The argument is briefly as follows. In a medium of refractive 

index ja, it is known that light is propagated with velocity if 

the medium is at rest. Suppose instead that the medium has a 
velocity v in the direction of propagation of light (in the same 
or the opposite sense). Ordinary kinematics would lead us to 
expect that the velocity of propagation (with respect to the 

observer) would become - + v; Fizeau and the others, however, 
by delicate experiments on interference phenomena, found that 

the amount to be added to - (or subtracted from it) is not the 

* 1 
whole of v, but v multiplied by the coefficient (< 1) 1 — — , 

so that the velocity of propagation is V* 

IH 1 --^) <2B > 

The factor 1 — * is known as FresneVs corweetion coefficient, 
p. 

The expressions (25) are evidently not in agreement with the 

Galilean kinematics. But they are in excellent agreement with 

the Einsteinian kinematics. In fact, let us consider, to fix 

ideas, the case in which the motion of the medium is in the same 

sense as the propagation of light, so that we take the + sign in 

c 1 
(25). Then (24') holds when we put - for v r and therefore - for 

ft. Hence it gives ^ 

C - + v 

V a = 

/* 

or, neglecting terms of the second order in )8( = 
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The last term is also of the second order with respect to the 
first, so that we are left finally with Fresnel's formula 



10. Further generalization of the metric of F 4 , still coinciding 
to a first approximation with ordinary dynamics. 

We now propose to see whether it is possible to assign to the 
F 4 other metrics slightly different from that characterized by 
(5), but such that the dynamical principle underlying them is 
still equivalent, to a first approximation, to Hamilton's principle. 

We return to the general form (5') of ds 2 , and observe first 
that the particular form (5) just considered is a case of (5') 
obtained by identifying the time co-ordinate x 0 with ct and the 
space co-ordinates x l9 x 2 , x 3 with the Cartesian co-ordinates 
tfv y*> and putting 



9oo =l- * u ~, g oi = 0, g ik = - 8f (i, ft - 1, 2, 3). (26) 



If now we wish to consider a metric whose coefficients g 
differ by very little from the values (26), we can put 



<7oo = 1 — 2 <£> 9oi = — y» 9* —8 k i — Y*> ( 26 ') 



with the understanding that the quantities y (which as regards 



or higher order with respect to while *fj (which also has the 

c 

dimensions of a number) is to be considered as of at least the 
third order. 

With these values for the coefficients, and taking the variables 
Vv V& Vz *° b e co-ordinates differing very little from Cartesians, 
we can write ds 2 in the form 



ds*= (l-2^)dy 0 8 -2rfy 0 S l y < % i -S £fr (8j+y«)rfy i ^. (27) 



where 



c2 + *, 



dimensions are pure numbers) are of the second order 




1 



1 
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If we denote derivation with respect to y 0 by a dash, and put 



v* 3 / 
- = 2 v a 



3 

T 2 = i^ ik y ik y £ y k 
1 



(28) 



we shall have 



It is to be noted that since y\ — - is of the first order, 

c af 

it follows that T x is of the third and T 2 of the fourth order at 
least. 

To shorten the work, we shall introduce the quadrinomial 

r - 4, + Tl +T t +ip, 

observing that it is composed of terms of the second order, 

which can be written \ (%v 2 + U), plus terms of higher order. 
We then have c " 

d? 2 

* = i--2i\ 

and we can extract the square root, neglecting powers of T 
higher than the second (i.e. terms of order higher than the fourth). 
To this degree of approximation we get 

= i-r- £r 2 , 

i.e. rewriting c dt for dy 0 , and multiplying by c?dt y 
cds = &dl — c 2 dt{V + iF 2 ). 

Substituting this expression in the variational equation of 
dynamics 

Sjcds •= 0, 
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and remembering that 8* vanishes at the limits of integration, we 
see that this equation reduces to 

and that the corresponding Lagrangian function is therefore 

l = c*(r + in), 

or expanding, and neglecting terms of order higher than the 
fourth (in the sense just defined, i.e. ignoring the presence of the 
factor c 2 ), 

L = frfi + u + j Ti + ^ + <ty + f. (it- + (29) 

The first two terms of the expression on the right (reduced 
to zero dimensions, i.e. divided by c 2 ) are of the second order; 
they constitute the Lagrangian function of the classical mechanics, 
from which we began our investigations. 

The successive terms of (29) (reduced to zero dimensions in 
the same way) are of higher order: hence they will represent 
small corrections to be applied to the equations of motion. The 
metric (27) which we have here assumed still gives, therefore, 
to a first approximation, the same laws as are deduced from 
Hamilton's classical principle. Besides the potential U, it 
contains the ten functions ip, y h , y vk of the four variables y (position 
and time); these are small, as we agreed, and as we have repeatedly 
had to remember in making the various transformations, but 
are a priori arbitrary. We shall see farther on how the law of 
universal gravitation and a criterion provided by the tensor 
calculus lead to the determination of these ten functions (from 
ten differential equations), and so to an explanation of some 
slight divergences which have been observed between the results 
predicted by the Newtonian mechanics and the true motion of 
the heavenly bodies. This more exact correspondence between 
theory and observation provides a physical justification of 
Einstein's new method of approach, which further incontestably 
represents an enormous speculative advance through its charac- 
teristic of securing invariance for all transformations of the 
co-ordinates, not only of space, but also of time. 
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11. An important particular case. Corresponding trajectories 
and their identity with those of an ordinary mechanical problem. 

We shall now apply the expression (29) for L to a special case, 
the interest of which will be seen in next chapter (§ 8, p. 394). 
Suppose that we have 

T x =- 0, 

(either exactly, or neglecting terms of order higher than the third 
and fourth respectively), where x i g a function of the jy's of at 
least the second order. Suppose further that ifi and like £/, do 
not explicitly depend on the time. We shall meet later on a 
characteristic example in which this condition is satisfied. 
The expression (29) can now be written 

L = K(l + 2 J + x)+^+^+^(^- ~ U y. (30) 

71^,2 _ m 2 

It is to be noted that ( 1 in the last term is of the 

\ c 2 / 



fourth order, while the principal part of L is of the second order; 

x v 2 _ u 

hence, in this last term, we may calculate ---- 0 only to a first 

c~ 

approximation. But we know that to a first approximation the 
classical mechanics holds, and that therefore the integral of vis 
viva exists in the form 

\v 2 — U ~-= B 0 = constant; 

hence the last term on the right of (30) can be replaced by the 
constant £2? 0 2 /c 2 , or even suppressed, since a constant contributes 
nothing to the variational equation. 

The remaining terms of L can be separated into two groups, 
according as they do or do not depend on the velocity, by putting 

r- i^(i + 2 J +x) 

and therefore 

l = t+ a 
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This form of the Lagrangian function corresponds exactly 
to the form found in the classical mechanics (for a system with 
three degrees of freedom, if not for a material particle), if we 
consider jTas corresponding to the vis viva and U to the potential. 
Further, it is known 1 that, whenever (as in this case) T is a 

dU- 

quadratic form in the quantities y { = -p, not explicitly con- 

dt 

taining t, and U is a function only of the Lagrangian co-ordinates 
y, then the differential equations arising from 

Bj(T+ U)dt = 0 

admit of the integral (of vis viva) 

T-U= E, 

where E is a, constant (the total energy), and the trajectories 
corresponding to a given value of E are identical with the geodesies 
of a manifold such that the square of its line element is defined by 

ds* 2([f+ E)Tdt 2 . 

(Principle of Stationary Action.) Applying all this to our case 
we shall have the integral of ris viva, in the form 

^ ( x + 2 J+ x) - (v+<*l) = E > 

and, for any value of E fixed in advance, we can assert that the 
trajectories coincide with the geodesies of the manifold 

dsf - (ll + cty +- B) (l + 2 ? + X ) M*> 

where dl 0 2 — S <: dy { 2 , 

i 

or with the trajectories of the motion, in ordinary space, of a 
materia] particle with total energy zero, and acted on by forces 
derived from the potential 

tf x = (ff + cV + tf) (* + 2 J+x). 

1 Of. for example Levi-Civita and Amaldi: Lczioni di vneccanica razionaU, 
Vol, II, Chapter XI, No. 16 (Bologna, Zanichelli ; in the press); or Whittakbr : 
Analytical Dynamics 2nd edition, Chapter IX (Cambridge University Press, 1917). 
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which can also be written (neglecting constant terms and terms 
of higher order) as 

U l= = V + c^+^+Ux+B^ + xy (31) 

12. Qualitative characteristics of relativity metrics. Geodesic 
principle for the dynamics of a material particle. Stationary and, 
in particular, statical line elements. 

In accordance with the remarks at the end of § 10, the 
metric of the space-time manifold V 4 in the region round a generic 
point must be regarded in concrete cases in close connexion with 
the physical phenomena which take place in space and time, 
particularly in the neighbourhood of the point and instant con- 
sidered. The quantitative dependence will be duly established 
in next chapter. At any rate, in ordinary cases, as has been 
seen, we can never go far from a pseudo-Euclidean metric. This 
leads to the condition that in the real world of physics the metric 
of F 4 is to have the same qualitative properties as those belong- 
ing to the pseudo-Euclidean metrics. In particular, the index of 
inertia must be 3, which implies (as could be proved) that in every 
set of four orthogonal directions drawn from a generic point, 
three are spacelike (ds 2 < 0) and one is timelike (ds 2 > 0). 

By " relativity metric " we shall from now onwards mean an 
indefinite metric subject to these qualitative restrictions. 

In a F 4 with a definite metric there is no qualitative distinc- 
tion to be made between the various lines in it, while in a relativity 
F 4 , as we have already pointed out, we have at every point 
three kinds of direction, according as ds 2 <L or > or — 0, and, 
corresponding to these, three kinds of line — spacelike, timelike, 
and lines of zero length. Naturally the classification is much 
more complicated for manifolds of two or three dimensions 
immersed in a F 4 with an indefinite metric; and the same choice 
of the variables of reference (which is geometrically equivalent 
to the choice of co-ordinate hypersurfaces) would in general 
require preliminary close study of the local behaviour from this 
point of view. 

We shall avoid any discussion of this kind, and shall impose 
some limits on the arbitrariness of the choice of co-ordinates 
by taking as a model what happens in the case of a pseudo- 
Euclidean ds 2 referred to ordinary time t (or a linear function y 0 
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of t) and three space co-ordinates a^, 5e a , 0%, which are entirely 
arbitrary. Of the four co-ordinate lines one (y 0 ) will then be 
timelike and the others spacelike; further, on any hypersurface 
y 0 — constant we have 



where dl 2 is a positive definite differential quadric, so that we can 
say that a purely spacelike metric, like that of ordinary geometry, 
holds in every timelike section of the space-time. We shall 
constantly refer the relativity manifold F 4 to co-ordinates x 0 , 

x 2 , x 3 for which this qualitative property holds. 

Granting these various preliminaries we reach the following 
geodesic principle— derived from the particular cases in sections 
4 and 10 by an obvious generalization — which, in Einstein's work, 
appears as a fundamental law of the dynamics of a material particle 
in clearly specified physical conditions (i.e. for an assigned ds 2 ): 

The world lines of a generic free material particle are identical 
with the geodesies of the corresponding ds 2 , and more precisely 
with the timelike geodesies. In other words, these world lines 
satisfy the variational equation 



making at the same time ds 2 > 0. 

Among the relativity metrics special interest attaches to those 
in which it is possible to choose a system of reference such that 
the ten coefficients g tJr shall all be independent of the timelike 
parameter x 0 ; metrics of this kind are called stationary (in 
relation to the particular system of reference chosen). The 
justification for this name is obvious if it is remembered that in 
physics a phenomenon which takes place in a continuous medium, 
and is determined by a certain number of parameters which are 
functions of position and of the time (e.g. the motion of a fluid) 
is called stationary if these parameters do not depend explicitly 
on the time. 

In particular, a stationary metric will be called statical when 
the coefficients g ol (i = 1,2, 3) of the three product terms in 
dx 0 vanish, i.e. when in the expression 



= eotut. 




& = </oo+ 4 -j- S*sr ft z'i 4 . . (32) 



1 1 
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(in which dashes denote differentiation with respect to x 0 ) the 
terms of the first degree in x[ are missing. The justification for 
the name is somewhat more indirect, and will appear from the 
following considerations. 

It is known, and can in any case be verified at once, that when 
L is an even function of the x"s (as in the case we are considering) 
the Lagrangian equations (4') define a reversible motion, i.e. such 
that if P = P(t) represents the motion starting from a certain 
initial position P 0 with an initial velocity v 0 , then on changing t 
into — t (i.e. considering the motion defined by P = P( — t)) 
we have the solution corresponding to the same initial position 
and the same initial velocity but in the reverse direction. Further, 
in the classical mechanics it is known that the motion of a particle 
is reversible whenever the field of force is invariable with respect 
to the time, i.e. when the field is statical (in the ordinary sense of 
the word). Hence the application of the term statical to a relativity 
metric whose geodesies are reversible with respect to the timelike 
variable x 0 . 

In the statical case it is usual to put 

<7oo ^ V*> 9ik = 

so that (32) becomes 

L* = V*-i ikaik x' iX U .... (32') 

1 

this coefficient V 2 has an important mechanical meaning, which 
we shall now explain. 

If at a given instant the velocity of the moving point vanishes, 
i.e. if each x[ — 0 (case of initial motion starting from rest), 
we have in particular from (4') and (32') 

S*a*a£' = (i= J>2,3) 

1 OXi 

(the two dashes of course denoting double differentiation with 
respect to x 0 ); these define the quantities x" as functions of 
position. The terms on the right, 

dV 2 
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being derivatives of a single function — |T 2 , evidently constitute 
a covariant system (for any transformations whatever of the 
space co-ordinates). Hence the -X/s constitute the covariant 
components of a spacelike vector P = grad ( — \V 2 ). The 
contravariant components 

1 

of this vector, from the preceding formulae, are identical with the 
initial accelerations. Hence the vector P obviously provides the 
statical measure of the force (per unit mass) of the field (the 
initial acceleration of a free material particle, or, if preferred, 
the force per unit mass which must be overcome to maintain the 
particle at rest). 

Consider, beside the point P of co-ordinates x £ , a neighbouring 
point P' of co-ordinates x t + dx L and the (invariant) trinomial 

i 

Defining, as is natural, the virtual work of P for the displace- 
ment PP' as the product of the displacement by the ortho- 
gonal projection of the force (just as in ordinary Euclidean 
space), the preceding identity shows that constitutes 
the potential function of the force acting in the field in statical 
conditions. 

As has been seen just above, in the statical case the force in 
the field can be very simply expressed by means of the single 
coefficient g w = V 2 . In more general conditions, the whole of 
the mechanics of the point is summed up in Einstein's geodesic 
principle, or, as an alternative form, in the consequent Lagrangian 
equations (4'); an analogous argument can also be developed for 
the initial motion, and the expression for the force in the field 
(at a generic point and instant) as a function of the g 9 s deduced 
from it, but the results are by no means so simple and expressive 
as in the statical case. To put it briefly, the concepts of mass, 
force, and energy are all contained in the four-dimensional metric, 
but, at least in general, the task of distinguishing between them 
and associating them with the coefficients of ds % seems to be neither 
easy nor fruitful of further results. 
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13. Versors in a F 4 with pseudo-Euclidean metric. 

An important fact in connexion with a versor (unit vector) 
in the space-time manifold F 4 is that it can always be made to 
correspond to a vector in three dimensions. This follows from the 
fact that it has four parameters (or moments), only three of which 
are independent, in virtue of the quadratic identity expressing 
that the length of the vector is unity (cf. Chap. V, p. 91). The 
interpretation of a vector of this kind as a velocity gives par- 
ticularly interesting results. 

Let us consider— limiting the case to a pseudo-Euclidean 
F 4 — a generic motion defining y l9 y 2 , y 3 as functions of y 0 , and 
giving rise to a world line in F 4 . If, as we shall first suppose, 
the velocity of the motion < c, we shall get a timelike line, in 
which the corresponding 

ds* - dy* - i t dy? = dyf (J - jP) 

1 

is positive. If, on the other hand, the velocity > c (i.e. j8 > 1), 
ds 2 is negative, and we shall have a spacelike versor (cf. Chap. V, 

p. 142. In either case, denoting as usual the components ^ of 

dy 0 

the Romerian velocity by /3 t and the direction cosines of this 

velocity by a, = ^-L we obviously get the expressions 
P 



£0 = d Vo 



ds 0 \ s/l—"pt 

for the parameters of the world line (i.e. of the versor § tangential 
to it). 

Given the three components of an ordinary vector (3, these 
formula determine the four parameters of a four-dimensional 
unit vector (versor) and vice versa. Given the versor (in 
the ordinary sense) a belonging to it is of course fixed without 
ambiguity (provided p =4= 0). It will sometimes be convenient 
to describe a as the versor reduced from the four-dimensional 
versor %. For p 0 the versor % has its components g 1 , i 3 
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all zero, and is accordingly called purely timelike. If instead we 
consider the case of a very large velocity in a direction a (i.e. 
if we make j8 tend to infinity, while the ratios between the /J^'s 
remain determinate), then we have f 0 = 0, while the other 
components f 4 reduce to the direction cosines a 1 of the reduced 
versor. In this case the four-dimensional versor % is called purely 
spacelike; it is tangential to the three-dimensional manifold 
(space) Xq — constant, or rather coincides with the versor a 
belonging to this manifold. 

All this can easily be extended to the case of a y 4 of any 
metric whatever, referred to any co-ordinates x 0 , x 1> x 2 , the 
first timelike and the other three spacelike, and characterized 
by the form 

3 

ds* = Jl lk g £k dx i dx k r-= dx£L\ 
o 

where I? denotes, as in § 12, the expression 



and, as usual, /? 



0 

dl 
dx 0 * 



Given a generic versor of parameters 



we shall have 



to doc$ _ 1 

g ' \ds ] L' 

pi d x i = ._ dx, dx^ _ _ /? rf^. 

^ | <Zs I d:r 0 | | i L dl' 

14. Digression on geodesies of zero length. 

Let t denote a parameter of any kind such that the co-ordinates 
x can be considered functions of it, and put 



(dots denoting differentiation with respect to r). Consider the 
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equations of motion of a material system as summed up in the 
variational equation 

sj2Tdr = 0 (33) 

We know from ordinary mechanics that if r denotes the time 
and T the vis viva of a material system, then the Lagrangian 
equations implicit in (33), i.e. 

d dT _ dT _ 0 _ 0 % 

ar ox t dx i 

define the spontaneous motion of the system and have as a first 
integral the equation 

T = E = constant. 

Whenever the value of the constant E is different from zero, 
then by using the equation T = E it is easy to eliminate the 
parameter r from (33) and obtain from it a variational equation 
capable of defining the trajectories. We have in fact, from the 
definition of T, 

\Z2Tdr = ds, 

so that the expression for the action, i.e. the integral f2Tdr 
which occurs in (33), can be written 

f%/2E J : 2Tdr = \Z2Ej%/2Tdr s/2E[ds. 

Hence, for E 4= 0, the variational equation (33), by elimination 
of the parameter t, gives the equation 

8[ds == 0, (35) 

which is the characteristic equation of the geodesies in the F 4 
whose line element is ds. From this equation we can deduce, as 
in § 24, p. 131, the differential equations 

*i + 2y{jl f = 0 . . . . (36) 



of the geodesies, where dots denote differentiation with respect 
to $. The same equations would also be obtained, but with r 
instead of s, by writing out (34) in full and solving for the x'a. 
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To sum up, for E 4= 0, it is a matter of indifference whether we 
define the geodesies of F 4 as trajectories derived from the varia- 
tional equation (33), or by means of the typical property (35). 

We now propose to examine separately the case E = 0; 
since T — E and ds 2 = 2T<Jt 2 , this is equivalent to d$ = 0 
along the whole of the line in question, which therefore in this 
case takes the name of geodesic of zero length. (Such lines are of 
course real only if ds 2 is an indefinite form.) In this case (35) 
is no longer suitable for defining geodesies; the method just 
referred to and used in § 24, p. 131, to obtain the differential 
equations also breaks down, since it assumes s as the independent 
variable, and therefore excludes the possibility of ds being identi- 
cally = 0. The equations (34), however, keep their significance, 
and therefore offer a means of defining geodesies of zero length 
by a process of passing to the limit (in conditions of complete 
analytical regularity) from ordinary geodesies. We shall thus 
apply the term " geodesies of zero length " to the lines represented 
by solutions of the Lagrangian system (3i) for the value zero of 
the constant E. 

The differential equations (3G) of ordinary geodesies give 
x 09 x l9 x 2 , x 3 directly as functions of a parameter t (or in particular 
of s). We can suppose the parameter eliminated after integration, 
giving, for example, x l9 x 2 , x 3 as functions of x 0 . But it is also 
possible to eliminate the parameter beforehand, by obtaining 
from (35) three differential equations which define x x , x 2 < x s 
as functions of x 0 . To do this, we introduce x 0 as the independent 
variable in (35), so that it takes the form 



From (35') we deduce, by the ordinary method, the three 
required Lagrangian equations, which are 




(35') 



where, as in § 12, we have put 

3 8 



1 1 



d dL dL 



(i == 1,2,3). 



dx 0 dx\ dxi 



These are completely equivalent to (35'), since, as was seen 
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in § 2, the fourth equation (to be obtained by making x 0 vary 
and equating to zero the coefficient of Sx 0 ) is a necessary conse- 
quence of these three. 

These equations, like (35) above, lose their significance in 
the case of geodesies of zero length (E = 0). An analogous 
reduction can be found in this case too, but it is preferable to 
follow another method and leave the pure Lagrangian form. 
This method is as follows. 

We start from (33) instead of (35), and note that from the 
definition of T and L we have obviously 

ar 2 

Suppose that x 0 is not constant along the geodesic (or arc of 
geodesic) of zero length under consideration. 1 In the integral 
(33) (corresponding to a generic geodesic of the kind in question) 
we can then assume x 0 instead of t as the independent variable, 
so that 

8[L* d pdx n = 0. 
j dr 



The parameter r is a function, a priori unknown, of x 0 , such 

dx 

that — remains finite and not zero. We can therefore put 
dr 

dx 0 A(x 0 ) 

where A and A are also finite and not zero. Then the preceding 
variational formula becomes 

sf&Adxo = 0, 

from which we get for the geodesies the equations 

dx 0 dx t OX, 

1 From the limitations introduced in § 12, this condition can always be satisfied 
in the real field. In fact, if we put dxo = 0 in ds 2 , there remains a definite 
negative form, which cannot vanish along an actual line, i»e. when dx if dx* dx^ 
do not vanish simultaneously. 
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expanding and dividing by A, these can be written in the form 

d_ d& _dtf = x dL* 
dx 0 dx'i dxi dx'i 

The parameter A can be at once eliminated from these. 
Denoting for shortness the Lagrangian binomial on the left-hand 
side by r i9 we get finally the two equations 

T i _ T 2 „ T 3 

3L 2 az 2 az 2 

9a?! SiPo 9a% 

which are to be taken together with the equation 

TJ = 0. 

15. Some elementary theorems of geometrical optics. 

It is known that in a transparent homogeneous medium light 
is propagated in a straight line with constant velocity if no dis- 
turbing influence is at work. In the case of an isotropic medium 
—the only one we shall consider— the velocity is always the same 
in all directions and therefore is a constant characteristic of the 
medium. In vacuo (cf. § 4) the velocity is, in round numbers, 

c ----- 3 X 10 10 cm. /sec. 

or 300,000 kilometres per second. 

If instead we have a heterogeneous medium, in which the 
refractive index /jl (which is defined as the reciprocal of the velocity 
of propagation) varies from point to point, then the rays are in 
general not rectilinear but are bent in accordance with a law 
which depends on the way in which fx varies, i.e. on the function 
fx(x 9 y, z). This law can be put in a compact and useful foTm 
in the following way. 1 If the initial point P 0 and the final point 
P t of the path of a ray of light are fixed, the time taken by the 
ray to go from P Q to P 1 along a line s will obviously be expressed 
by the integral 

since ft, as we have just said, is the reciprocal of the velocity. 

1 Of* for example Levi-Civita and Amaliw : op. cit, Chap. XI, No. 18. 
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Now the line actually followed by the light is the one which 
makes this integral a minimum, and therefore satisfies the 
condition 

8t = 0. 

This variational equation, which sums up the whole of 
geometrical optics, is known as Format's principle. 

16. Geometrical optics according to Einstein and the meaning 
ot the constant c. 

In constructing a geometrical scheme to represent light rays 
the existence is assumed of an absolute frame of reference, exactly 
as is done in the Newtonian mechanics. In order to help the 
imagination, the system of reference is supposed to be provided 
by a hypothetical medium at rest— the so-called cosmic ether — 
which constitutes as it were a background or support for all 
optical phenomena. In space free from ponderable matter light 
is propagated in a straight line with constant velocity c with 
respect to the ether, or, which is the same thing, with respect to 
fixed axes, where " fixed " axes mean axes at rest with respect 
to the ether. Hence c is the velocity of light as it appears to a 
generic observer O, at rest with respect to the ether. 

Consider a solid C moving with velocity u (a pure translatory 
motion) and a pencil of parallel rays of light which are being 
propagated in the same sense as the motion of C. 

With respect to the observer O, the luminous phenomenon 
is diagrammatically represented, as we have just noted, as a 
particular uniform motion with velocity c. 

According to ordinary kinematics, the analogous velocity 
with respect to an observer O' rigidly attached to C is c — u. 

Now within the range of velocities which can be realized by 

material bodies the ratio -, and still more its square ~ (only 

c cr 

the latter of which can be submitted to effective experimental 
control) are small; we can, however, take it as definitely estab- 
lished that the velocity of propagation is still c with respect 
to O' also. This follows from the classical Michelson-Morley 
experiment, subsequently repeated by other physicists, and 
recently on new bases by Professor Majorana. 

In order to explain this experimental result, it is evidently 
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sufficient that the phenomenon which appears to macroscopic 
methods of measurement as the translation of a body C with 
velocity u. should, with more refined methods of measurement, 
be a transformation (A). The study of these transformations has 
in fact shown that any ordinary uniform translation is almost 
indistinguishable from a (A), the difference being of the order 

of one ten-millionth, provided that - < 10 ~ 4 . 

c 

The classical laws of geometrical optics (that the propagation 
of light is rectilinear, uniform, and with velocity c), and the 
famous experiments referred to above, will therefore still hold 
if we suppose that for the propagation of light, as for the motion 
of a material particle under no forces, the equation 

hjds 0 =-= 0 

holds, with the condition 

ds* 0 

(equations of uniform motion with velocity c); and if, on the 
other hand, we consider the phenomenon of the translation of 
solid bodies as very slightly different from the description of 
ordinary kinematics, so that it corresponds to a transformation 

Hence these special kinds of motion which correspond to 
the propagation of light in the ether, in the absence of disturbing 
influences, are dependent on the form 

ds* - c 2 ^ 2 - dl 0 2 , . . . . (37) 

in which the constant c has a specific numerical value. 

For ordinary motion, with velocities which are at most 
planetary, and under the action of conservative forces— e.g. in 
the presence of assigned masses — the same part is played by the 
form 

ds* = (c 2 - 2U) dt* - dl* . . . (37') 

in which on the one hand the constant c is subject only to the 
qualitative restriction of being sufficiently large, and on the 
other the influence of the masses modifies to some extent the 
coefficient of dt 2 . If we aim at attaining unity of conception of 
physical phenomena, we shall obviously be constrained, cceteris 
paribus, to adopt a single differential form ds 2 as the determining 
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form both for the motion of material particles and for the behaviour 
of light fays, serving as a basis for both cases. We must therefore 
assign to the constant c, in the general dynamical case, the same 
specific value as belongs to it in the particular optical pheno- 
menon. In the absence of disturbing influences, in particular of 
masses at a perceptible distance, so that U = 0, the ds 2 of 
mechanics then becomes identical with the ds 2 of optics (the 
limiting case). 

Further, since in the case U = 0 (i.e. in the absence of 
masses at a perceptible distance) the intervention of ds 2 has led 
to geometrical optics being summarized in two laws which appear 
as limiting cases of dynamical laws, we are led to hope for the 
extension of the same criterion also to the case in which masses 
exist (U =K 0). 

The propagation of light will therefore be governed in any 
case by the following postulates: 

(1) The geodesic principle (as for material motion), 



(2) ds 2 ==- 0, which is equivalent to saying that the motions 



The velocity V is thus slightly less than c; neglecting terms 
which are in fact absolutely negligible, it is given by 



These two postulates can be summed up in a single illuminating 
geometrical assertion: 

In the metric we have assigned to V 4 , the world lines of light 



It is to be noted that this assertion has an invariant form, 
and is therefore suitable for defining the behaviour of light rays, 
even if these are referred to a system of any co-ordinates x 0y 
x l9 x 2 , #3 whatever, instead of to the particular system t, y v y 29 y 3 . 
The assertion lends itself to an obvious generalization, since it 




(38) 






are geodesies of zero length. 



C D 666 ) 
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is natural to extend its scope so that it shall continue to hold 
even when the ds 2 which characterizes the metric of the F 4 , 
though satisfying the qualitative restrictions of § 12, is not 
reducible to the particular form (37). 

17, Interpretation in geometrical optics of the condition 

<& 2 = 0. 

Given any direction in the four-dimensional space (t, x x , 
%2> x z)* i- e - ail 7 system of increments (dt, dx l9 dx 2 , dx s ), we can 
obviously make a vector (velocity) v correspond to it in the 
physical space whose line element is given by 

3 a 
dl 2 = — Ji tt g tk dx i dx k ^ Jl^a^dx^x^ . (39) 
1 1 

or more precisely in the Euclidean space tangential to the given 
space at the generic point from which the specified increments 
are drawn. 

We shall take the ratios 

= ** {i = h 2 ' 3) 

for the contravariant system of this vector with respect to the 
metric (39). Writing these in the form 

dx t dl 
dl dt' 

dx ' 

we see from the presence of the factor - \ which is the direction 

dl 

parameter, that the positive factor ^ measures the length of the 

dt 

vector. Referring back to the equation (39), we have for the square 
of this length 

2 dl* * 

Another vector w, a function solely of position and time 
(of position alone in stationary conditions), can be made to corre- 
spond to the set of three coefficients g oh which are covariant with 
respect to any transformations whatever of the space co-ordinates 
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alone, by taking these three quantities for the covariant system 
of the vector. Then, denoting as usual the coefficients of the form 
reciprocal to (39) by a** and putting 

3 

w 2 = X ik a ik g oi9ok> 
i 

we get w for the length and (for w > 0) the ratios ^2* for the 

- w 

moments (the system reciprocal to the parameters) of the 
direction of this vector* It is to be noted that if the spacelike 
co-ordinates x have the dimensions of a length, the coefficients 
a ik of dl 2 , and therefore their reciprocals a l,c , are pure numbers, 
while the coefficients g {)l of the product terms in t have the dimen- 
sions of a velocity. Hence the vector w, like v, can be interpreted 
as a velocity. It will be obvious that this conclusion still holds 
even if the dimensions of the co-ordinates x* 9 x 9 * are left 
indeterminate. 

If <f> denotes the angle between v and w, both for the moment 
supposed not zero, we have for the metric (39) 

cos^ - 5, 

1 W V 

and therefore identically 

3 

vw cos<j6 ' = E, g oi x i9 . . . . (40) 

which holds even if v ot w vanishes. 

Using (39) and (40), the expression for ds 2 can now be written 
in the form 

ds 2 = dt 2 (V 2 + 2w cos0 — v 2 ), 

putting V 2 = .^oo. 

This makes it evident that the condition ds 2 ~» 0, charac- 
teristic of the propagation of light, defines its velocity v as a 
function of the position and direction of the ray, as well as of 
the time, in the general case in which the coefficients of ds 2 , and 
with them V, w, and <£, depend on L 

Representing the ratios - and (both positive and pure 

numbers) by /J and p 9 we have for fi the equation of the second 
degree 

jS 2 — 2p cos^ yS — 1 = 0; . . . (41) 
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the product of the roots being — 1, it follows that one is positive 
and the other negative. By definition v is necessarily positive, 
so that it is uniquely determined by (41). 

When all the product terms in dt vanish (the statical case), 
w == 0; hence /3 — 1, and v coincides with F. In general 
p > 0, and the difference between v and V (for a specified posi- 
tion and time) depends on the direction of the ray, i.e. on the 
angle <f> which it makes with w. We also have v — V for every 
ray perpendicular to w. It is obvious from (41) that the maximum 
and minimum values of /? correspond to <f> — 0 and <f> = tt. 
This is equivalent to saying that the maximum velocity of pro- 
pagation 

V(s/l+~fi + p) 

is along w, and the minimum velocity 

V(s/i + p 2 — p) 

is in the same direction but in the opposite sense. 

Except in the statical case, it will be seen that the propagation 
of light in physical space is not only non-symmetrical for opposite 
senses but is completely irreversible. 

18. Fermat's principle in stationary relativity metrics. 

We saw in § 14 how the difficulty involved in the variational 
principle S jds — 0 for ds 2 — 0 can be evaded in finding the 
explicit form of the differential equations of the propagation of 
light. It is not without interest to note that for every stationary 
ds 2 the behaviour of the light rays can also be defined by 
associating Fermat's principle of the minimum time with the 
equation ds 2 = 0, i.e. by assuming 

8{dx Q - 0, (42) 

with the condition that dx 0 is to be connected with x 09 the space 
co-ordinates, and their differentials by ds 2 = 0. Naturally 
while in the four-dimensional geodesic principle expressed by (38) 
not only dx l9 dx* z , dx$ but also dx 0 are to be zero at the ex- 
tremities of the interval of integration, in (42) this condition 
must not apply to rf# 0 , as it would reduce (42) to a mere identity. 
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We now propose to establish the equivalence, for every 
stationary metric, of the two principles of geometrical optics: 
(a) the four-dimensional geodesic principle, and (6) the principle 
of minimum time. 

To do this we must consider the geodesies of zero length as 
derived by the method of limits from timelike geodesies (d$P > 0). 
For the latter we put as usual 



x[ =p (» = 1, 2, 3) 

r = J— 2 = S»«tt** X k 
dX n 1 



ds^ " t ~ ft 

L 2 = = V 2 + 2 S,flr 0t x t — S*a ift Xi x k 



dx, 



(43) 



where the function L has finite partial derivatives, since ds 2 9 
and therefore L, is not to vanish. 
The equation (38) can be written 



8 Jl dx 0 



0. 



(44) 



Taking the variation with respect to the co-ordinates x l , x 2 , x&, 
we get by the classical procedure the Lagrangian equations 



d dL dL 

dx 0 dx' ( dxi 



0 (i = 1, 2, 3); . . (45) 

while the variation with respect to x 0 gives 

d /» dL , T \ dL n 
— (2 t — 7 x i — L) + — = 0, 

which is a necessary consequence of the equations (45). 

On the hypothesis, characteristic of the stationary case, that 
L does not explicitly contain x 0> we get the integral 



L-il L f x: = e, 

1 OX: 



(46) 



where the constant E represents the total energy of the moving 
point. 
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Multiplying by L, the left-hand side may be written in the 

form iz»+i(tf-if!A 

\ i dx { / 

It follows from the third of the equations (43) that L 2 is a 
polynomial of the second degree in x' l9 x %y x' s ; it is there already- 
divided up into three homogeneous sets of terms of degree 0, 1, 2 
respectively. By Euler's theorem on homogeneous functions, 

3 J)£2 , 

the linear term disappears from the difference L 2 — S £ — 7 x u 

i dxi 

which reduces to V 2 + jS 2 . Hence (46) multiplied by L gives 
\L 2 +\{V* + fP) = EL. 

The left - hand side is essentially positive when L tends 
to zero, being in fact, for L — 0, > I V z (which is to be 
taken as having a lower limit which is not zero in the field 
considered). The product EL can therefore be considered as a 
function of the x\s and x'\v which is always regular, and not zero, 
when L tends to zero; in the latter hypothesis the constant E 
obviously tends to infinity. 

Further, for all motions with the same total energy E, the 
principle (44), in which we suppose that 8x 0 vanishes at the 
extremities of the interval of integration, can be replaced by an 
analogous one which has the advantage over the first of not 
requiring this condition to be satisfied. In fact, for 8x 0 zero at 

the extremities, we have bjdx 0 — 0, and in consequence (44) 
is equivalent to 

sf(L — E)dx 0 = 0 

or, for E 4= 0, to 



8/(l-|)^=0; 



and in this last equation we can drop the condition that 8# 0 
vanishes at the extremities, since if we transfer the 8 under the 
integral sign and apply it to dx 0 (both explicit, and implicit in 
the x"&) we get 

which vanishes in virtue of (46). 
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It is therefore established that, for an assigned non-zero 
value of E t the equations of motion can be expressed by means 
of the formula 

*/(* ~ *) = ° • • • ' < 47 > 
without the necessity of imposing any condition as to 8x 0 . 

L 2 

The function under the integral sign can be written 1 — — 

EL 

from which it appears, remembering what was said above about 
the behaviour of EL, that this function is regular and tends to 
unity if L tends to zero. Now this is precisely the hypothesis 
which corresponds to the transition from material motion to the 
limiting case of the propagation of light. Since the function is 

regular, the order of the operations S J and passage to the limit 
may be interchanged, so that (47) gives Format's principle 

h[dx 0 = 0. 

Permat's principle can be put in a purely geometrical form, 
referred to the spacelike metric with line element dl, if we give 
dx 0 the value found from ds 2 = 0 in terms of x 0 , x v x 2 , x Zy dx v 
dx 2 , dx 3 , and insert this in the formula just above. The result 
is particularly easy to interpret in the statical case (g 0i = 0, 

i — 1, 2, 3), in which we have evidently dx 0 = - , and Fermat's 
principle takes the form 

./* - o. 

This shows that the light rays coincide with the geodesies 
of the three-dimensional space with line element ^; alternatively, 

referring to the physical space dl 2 and again applying the theorem 
of least action (cf. § 11), we can say that they coincide with a 

pencil of trajectories corresponding to the potential ^ 2 and 
total energy 0, 
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19. The stress tensor and its divergence in the classical 
theory. 

Let there be given a continuous medium, and in it a surface 
element (facet) dcr; one side of this facet is supposed chosen as 
the positive side, and one sense of the normal direction is associated 
with it. We shall agree that this sense is the one which corre-r 
sponds to the passage from the negative to the positive side, and 
shall denote its versor by n. The resultant of the molecular 
actions which the particles on the negative side of the element 
exert on those on the positive side is ordinarily called the stress 3 
relative to the positive side of the element considered. 2 In normal 
cases— the only ones we propose to consider — this resultant is 
of the same order of magnitude as da y and is represented by * n efcr, 
where is the specific stress on the positive side of the surface 
element normal to n. 

Referred to orthogonal Cartesian axes Oy 1 y 2 y H9 the three com- 
ponents of the vector * fl will obviously be denoted by *„ • (i =-= 1 , 
2, 3). To characterize the distribution of the stresses at a single 
point P, we introduce the three stresses ♦ 1 , «£ 2 , * 3 which act on 
the facets at P parallel to the co-ordinate planes, or, more pre- 
cisely, the facets whose normal versors are in the positive directions 
of the co-ordinate axes. Their components are denoted in order by 







3>i 3 ; 




#22. 


^23 ; 




®32> 


$33 ; 



it follows from the postulates of ordinary mechanics that the 
matrix formed by these terms is symmetrical or that 

<E>32 = *13 = *31> ^21 = *12> 

so that there are really six of these quantities <f> £Jt (i y k = 1, 2, 3). 

. 1 Of., for example, A. E. II. Lovr, Mathematical Theory of Elasticity, third 
edition, Chap. II ; Cambridge University Press, 1920. 

a Some authors, Love in particular, invert the respective roles of the two sides 
of the facet in their definitions, and therefore, by the principle of reaction, change 
the sense of the vector described as the stress. The sign of its components will 
be changed accordingly, and the inequality will be inverted which determines 
whether a given stress is of the nature of a pressure or a pull with respect to the 
element considered. 
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Putting »* for the components of the versor n (its direction 
cosines) we get the fundamental formula 

*„ = S***n*, (48) 

1 

and hence for the three components in the direction of the 
co-ordinate axes 

3 

If % is a generic direction of direction cosines g l 9 the scalar 
product * u X i.e. the component of the stress * w along 
can naturally be written in the form 

1 

From the symmetry of the O^'s, it follows that in the sum 
just written down <t) lk can be replaced by <S> H \ the sum is there- 
fore, by (48), equivalent to the scalar product * f X n. Hence we 
have the relation of reciprocity, expressed by the equation 

*i. X \ - X n. 

For \ n we have in particular what is called the normal 
stress, i.e. the component along the normal to the facet of the 
stress with respect to the facet itself. In accordance with the 
conventions we have adopted, the stress will be of the nature of 
a push or a pull according as this normal component is positive 
or negative. From the remarks above, the necessary criterion 
is provided by the sign (for £ — n) of the expression 

i 

To make the notation uniform, we shall write %' instead of n. 
Consider the bilinear form 

<D - (49) 

1 

which represents either the component along % of the specific 
stress on the facet normal to or the component along §' of 
the specific stress on the facet normal to 
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If now we replace the y's by any curvilinear co-ordinates x 
whatever (the geometrical nature of the space characterized by 

S t <Zy/" being of course regarded as invariant), then the parameters 
i 

£*, of the directions constitute, as we know, two contra- 
variant systems, reducing to the direction cosines in Cartesian 
co-ordinates, while the scalar quantity O just defined will behave 
as an invariant on account of its intrinsic meaning. It follows 
(cf. Chapter IV, p. 70) that the coefficients of the bilinear form <D 
(referred to these parameters as arguments) will constitute a 
symmetrical covariant double system which is called the stress 
teri'Sor. Extending the notation adopted in the case of Cartesian 
co-ordinates we shall denote it by 0 /fr . This tensor will of course 
have the contravariant components <l>' /f == O*' and the mixed 
components <D*, which can be obtained in the ordinary way by 
composition with the coefficients of the fundamental form. 

The stress tensor depends in general on the position of the 
point considered; the components O^, referred to generic co- 
ordinates x, can therefore in any case be thought of as functions 
of the co-ordinates, and therefore as having derivatives — 
ordinary, covariant, and contravariant. As we saw in Chapter 
VI, p. 153, from a given double tensor X t1c we can always obtain 
a vector Y intrinsically related to it, which we called its 
divergence, and whose covariant components are defined for 
n — 3 by J{ 

Now the divergence of the stress tensor has an important 
mechanical interpretation, which can be found at once by using 
Cartesian co-ordinates. We know in fact that the molecular forces 
applied to a given particle by all the surrounding particles have 
for their resultant a vector x> whose components per unit 
volume, in orthogonal Cartesian co-ordinates, are given by 

(5i) 



Noting that in this system of reference the divergence of 
fe expressed by precisely the sum on the right of (51), and remem- 
bering that the covariant components of a vector are identical 
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in this case with the ordinary components, we see at once that 
the vector x divergence of the stress tensor with its sign 

changed. Applying the formula (50) we can therefore write 

Xi = -S„a w (51') 



20. The fundamental equations of the mechanics of continuous 
systems* referred to fixed axes; transformations of them in general 
co-ordinates (space co-ordinates). 

It is known that, when no hypothesis is made as to the nature 
of the medium, and when therefore the stresses are not particu- 
larized, the fundamental equations of the mechanics of a con- 
tinuous system reduce to the dynamical equation 

pi - P F + X (52) 

(where p is the density, f the acceleration, P the force per unit 
mass, and x the vector defined in the preceding section), together 
with the equation of continuity 

\ P t + <Kv(pv) = 0 .... (53) 

(v being the velocity), which can also be written 

d £ + pdiv(v) = 0, . . . . (530 

where the symbol denotes a " proper " derivative, i.e. one 

which considers p as depending on ' in such a way that as t varies 
p refers always to one and the same particle of matter. 

If now we wish to find the explicit form of these two equations 
with reference to any co-ordinates x whatever, connected with 
the y's by formulae which do not involve the time, all we need 
do is to obtain the expressions for the covariant (or contravariant) 
components of the vector f, since those of x are already known 
from the preceding section (cf. formula (51')) and the invariant 
expression of div(pv) is known from p. 153, Chapter VI; the force 
P will naturally be supposed given by means of its covariant (or 
contravariant) components. 



348 ABSOLUTE DIFFERENTIAL CALCULUS 



The acceleration f is defined by 

dt 

where the (proper) derivative is supposed to be calculated with 
respect to an observer (system of axes or, more generally, co- 
ordinate net) fixed in the mechanical sense of the word. 

Referred to co-ordinates y this relation is equivalent to the 
three scalar relations 

& = y! = a; + L * a v * = lj 2 > A )- • < 54 > 

(i£ i dy^ 

If now, with reference to any co-ordinates x whatever con- 
nected with the y's by relations which do not involve the time, 
we consider the simple system 

^ + (54') 

it is easy to see that this is covariant. In fact, on the one hand 

S v • 

the quantities (t being a parameter not involved in the trans- 

ot 

formations) are covariant like the #/s, and on the other the 

3 

quantities H k v^ k v k are covariant from the law of contraction 

of tensors. Noting once more that in orthogonal Cartesian 
co-ordinates the covariant derivatives reduce to the ordinary 
derivatives, and also the covariant and contravariant components 
of a vector to the ordinary components, we see that in these co- 
ordinates the expressions (54*) are identical with those on the 
right of (54), i.e. with the (covariant) components of f. This 
identity will still hold with reference to the as's, and we can 
write ~ s 



dt 



i 



We can now find the explicit form of the equations (52) and 
(53) with reference to the co-ordinates #. The first will give the 
covariant equations 

H^f^ ) = ^"f w/ ^ ■ (55) 
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and the second the invariant equation 



0, 



(56) 



or 



0. 



(56') 



21. Galilean systems of reference. 

Among the purely spacelike transformations a particularly 
simple group consists of those which give the change from a 
system of fixed (in the mechanical sense of the word) Cartesian 
axes to a system of Cartesian axes in uniform translatory motion 
with respect to the first set; the latter system is called Oalilean. 
The definitions of force, specific stress on a generic surface element, 
and divergence (whether of a vector or a tensor) are not changed 
in a transformation of this kind, but the velocity v of a generic 
point is altered by the addition of a constant quantity represented 
by the velocity of translation t; this addition, however, evidently 
does not alter the acceleration (i.e. the proper derivative of v). 
It follows that such a transformation leaves unchanged the 
dynamical equation (52), and also the equation of continuity; 
the latter is evident from the form (53'), which, in addition to 
div(v) (which, as just pointed out, is invariant), contains the 
proper derivative of p, which from its intrinsic meaning is obviously 
independent of the axes of reference. 

Furthermore, all the laws of the classical mechanics are known 
to be unaltered if the axes of reference are supposed to be in 
uniform translatory motion. 

22. Equivalent form for the system (52) and (53). 

In the general equations of motion of a continuous system 
the force per unit mass F occurs explicitly. From the formal 
point of view we can always, and in an infinite number of ways, 
consider F as the divergence of a suitable tensor; its components 
can then be supposed amalgamated with the 0 /y /s, so that we 
can at once put F == 0 in the equation (52). 

From the point of view of application this is not always con* 
venient, and in many cases the direct method is preferable; but 
from the speculative point of view this process of submerging 
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the force per unit mass in the stress is not only legitimate, but 
in accordance with the physical standpoint which refuses to 
admit action at a distance, asserting that every disturbance is 
transmitted by mediate action. In virtue of these considerations 
we shall put P = 0 in the vector equation (52), 

We now propose to transform, without altering their content, 
the three scalar equations included in (52) and the equation of 
continuity (53), in such a way as to replace these four equations 
by a set of four substantially identical equations. 1 

Referring to orthogonal Cartesian axes y, we project the 
equation (52) (in which we have now put P = 0) on the axis 
y t ; using (55), we get 



i oy k J i dy k 



(57) 



while the equation of continuity (53) or (53') takes the well 
known form 

%+i a « = 0. . . . . (58) 
dt i dy k 

Adding (58) multiplied by v t to (57) we get 

dt i dy k i dy k 

which can be written 

- sr* + ^ t- + **> = °- • • < 57 '> 

dt i dy k 

It will now be seen that the quantity on the left of (57') and 
(58) is in all four cases the sum of partial derivatives with respect 
to the independent variables t 9 y l9 y 2 , y 3 . It follows from § 5 that, 
since p denotes the material density, € = c 2 p can be interpreted 
as the energy density; further, it may be seen in a moment that 
the vector pv (the momentum density) represents the flux of 
matter (per unit of surface and of time), and therefore the flux 
of energy will be c*pv = cv. 



l CL particularly G. D. Ma'PTIOLI, Mend. Acc. Lined, Series V t Vol. XXIII 
(Becotid half-year, 1914), pp. 328-334, 427-432. 
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Now to give greater uniformity to the equations (58) and (57'), 
and to use in them the quantities whose physical interpretation 
has just been noted, we must replace t and the v/s by their 

Romerian expressions y 0 — ct, jS, = V -\ and put 

c 

T w = e = cV (59) 

T<x = - *Pi - r/w„ .... (60) 

J* = T w = 4>,, + ej8A = *a + • (61) 

(t, * = 1, 2, 3). 

The result is that the four equations (58) and (57') are all 
included in the single equation 

= o . . . . (62) 

by giving ? " m turn the values, 0, 1, 2, 3. 

From the equations (59), (GO), (61), we can see the interpre- 
tation of the various T's. T m represents the energy density; 
T Qi (i -~ 1 , 2, 3) the components with their sign changed of the 
relative Romerian flux; the T </f 's (i, h - - 1, 2, 3) in statical 
conditions (?>, --- 0), reduce to the ordinary stress components, 
from which they differ in general by the additive terms <r#/? A . 
= pv t v k (which, however, in ordinary circumstances are unim- 
portant compared with the other terms). To distinguish when 
necessary the T ik 's from the ordinary stress d> ifci we shall call 
them the kinetic stress. 

23. Einsteinian modification of the equations of motion of a 
continuous system in a particular case. 

The original equations (52) and (53), and therefore the equiva- 
lent set (62), are invariant when the axes of reference undergo 
an ordinary uniform translation. In the earlier stages of the 
argument we set out to give the dynamics of a material particle 
a form which should be invariant for a generic transformation 
(T 4 ), and we were induced to use Hamilton's principle in order 
to modify the equations of motion slightly. It followed from this 
operation that when there are no external forces the equations so 
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modified keep their algebraic form unaltered, not only for ordinary 
translations but also for the Lorentz transformations which we 
studied in detail in § 8. 

Now the dynamics of a continuous system must clearly 
include as a limiting case (corresponding to a medium of density 
everywhere zero except in one very small region) the mechanics 
of a single material particle. This at once shows that it is abso- 
lutely necessary that the postulates introduced for the mechanics 
of a continuous system should be brought into harmony with , 
the modifications accepted above in the mechanics of the material 
particle. The form of the equations (62), when there are no 
external forces, must therefore remain unchanged for any Lorentz 
transformation. If in accordance with (59), (60), and (61) we 
take for the T ik 'a the expressions 

Too = *> ?0i ™ ~ «A. = ePft t9 . (63) 

this condition is not rigorously satisfied, though, as we have just 
pointed out, there is in variance for ordinary translations; but 
it is easy to show that the required in variance for Lorentz trans- 
formations can be obtained by a modification, which, as usual, is 
very slight in the condition? ordinarily realized. 
To do this, we take the four-dimensional form 

<V = %o 2 - *W 
used above in discussing the dynamics of a particle, where as 

3 

usual d/ 0 2 — Hidt/i 2 . 
i 

Denoting by dy h (i = 0, I, 2, 3) the increments of the co- 
ordinates of the generic material element of the system under 
consideration, and by dl Q and ds 0 the corresponding elements 
of the (spacelike) trajectory and the world line, we have by 
definition 

A - (64) 

whence p = r* (64') 

and <V = %o 2 (1 - P)- ■ ■ • (64") 
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The parameters of the world line are 

ds 0 

(where we have suppressed the sign of absolute value, since in 
dealing with the motion of a material particle we must have 
jS 2 < 1, or dsy 2 > 0); they can be expressed in terms of the /?/s, 
using (64), (64'), and (64"), in the form 

A© = = _L 

ds 0 s/i-~fP' 



ds 0 VT— 



(i = 1,2,3). 
From these, taking account of the general formula 



and of the values of = g 0lk corresponding to ds 0 2 (cf. § 6, 
formula (12)), we get the moments 

X - Ji 
If we take the values of the monomials 

as given by these formulae, and compare them with the expressions; 
(63) for the T u '&, we see at once that the difference between 
each of them and the corresponding T ik is of the second order* 
We shall now show that if in the equations (62) we replace 
the values (63) of the 7\ 7 /s by the very slightly different values 

T* = *K K & * = 0, 1, 2, 3), . . (65) 



the equations will behave in the required manner for Lorentz 
transformations; and we shall be able to deduce the criterion. 
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to be applied for transforming the equations in the more general 
case. 

Note first that, if ds 2 is taken as the fundamental form, the 
terms on the right of (65), and therefore the constitute a 

covariant double system. Further, taking into account once 
more the particular values g% of the coefficients of ds 2 expressed 
in terms of the co-ordinates y, it will be clear that the covariant 
derivatives of the T^'s are identical with the ordinary derivatives, 
and that the terms on the left-hand side of (62) can be written in 
the form 

0 ?)T 3 3 

Icl9 — ^kl9 — ^k-iik* 

o oyi 0 0 

and are therefore identical with the covariant components of 
the divergence of the tensor T tk (cf. Chapter VI, p. 153). These 
equations therefore collectively express the fact that the diver- 
gence of this tensor vanishes — a property which is invariant for 
any transformations whatever of both the space and the time 
co-ordinates. Remembering finally that a Lorentz transformation 
leaves unchanged the form of ds 2 , we can now assert that the 
equations (62), with the values of T lk given by (65), will still 
hold after the application of any Lorentz transformation. 

Q. E. D. 

24. General case. Introduction of the energy tensor, and 
meaning of its components in general co-ordinates. 

When there are no stresses, the result we have arrived at is 
that we assign to the T lk a corresponding to the motion of a 
generic continuous system the tensor value given by 

T ik = eA, Aj., 

where e is the energy density and the A,'s are the moments of 
the world line of the material element. Further, given any 
distribution of stresses, referred to Cartesian co-ordinates, then 
in order to transform the equations of motion into any spacelike 
co-ordinates (leaving the time unchanged) we have traced out 
an argument based on the invariance of the bilinear form 

<t> = E^Otf. £ l i k (which showed us the three-dimensional tensor 
l 
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character of the <J>^'s when we pass to generic co-ordinates), 
on the vector character of the velocity, and on the invariance of 
the density. 

We now propose to consider more generally transformations 
(T 4 ) of space and time (i.e. of the set y 0> y l9 ;/ 2 , y 3 into a new set 
x & x v x 2> ^3)9 keeping the results already obtained in the two 
particular cases just referred to (cf. § 23 and § 20). A sufficient 
condition is that the T lk a (defined physically with reference to 
a particular system of co-ordinates) shall have the character of 
a tensor for any transformations whatever. The tensor so intro- 
duced is called the energy tensor. 

This is equivalent to asserting the invariance of a bilinear 
form in four variables 

B = S^^'* 

0 

having for its coefficients the quantities T ik and for its arguments 
the parameters 

ds 9 ds' 

of two arbitrary four-dimensional vcrsors l*, It will be seen 
at once that this postulate covers the two particular cases already 
discussed. In fact, when there are no stresses the tensor character 
of the T £k & follows from the expressions (65) adopted for them, 
while for transformations (T 3 ) which leave the timelike co- 
ordinate unchanged, the invariance of Ii involves that of the form 
O, as will be seen from the following argument. 

When we pass from a system x to a system x, it follows from 
the invariance of B that (with obvious meanings for the notation 
used) 

0 0 

In the case of a ( T 3 ), we shall have dx 0 = dx 0 , d'x 0 — d'x 0 , 
and therefore 

3 8 3 

T QO dx 0 d , x 0 +dx 0 Jl k T 0f 4 f x k +d f x 0 X k T 0k dx k +X a T^dx i d , x k 
1 1 1 

- T w dx 0 d f x 0 +dx 0 i k f 0 ,d f x k + i^Tadxrfx,; 
1 1 1 
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and as this must hold whatever the differentials dx 0 , d'x 0 may 
be, it follows that 

a s _ 

SfrTpfrda^ = Sjt 2V die*, 
1 i 

1 1 

As the differentials rf^, d'a^ are arbitrary, these relations 
express the fact that T w is an invariant (the energy density), 
that the T^'s are the components of a vector (the flux of the 
energy with its sense changed), and the 2Vs those of a 
covariant double tensor (the kinetic stress). 

Q. E. D. 

We now propose to examine, with reference to pseudo- 
Cartesian co-ordinates y, the physical significance of the 
form B when the directions Jj, are chosen in a particular 
way. 

Suppose first that both the directions are purely timelike, 
i.e. that 

dx t = d'x { = 0 (i = 1,2, 3) 

and therefore ds 2 = dy 0 2 , ds' 2 = d'y 0 2 . 

Then the only parameters which are not zero are £° and £'° 9 
which are equal to 1, and there remains 

B = T w \ 

i.e. in this case B represents the energy density. 

Now suppose that are purely spacelike, i.e. that 

dy 0 = d'y 0 = 0, and consequently 

ds 2 = - dl*, ds' 2 - dl' 2 . 
Then there remains 

3 



» y m dy t d'y k 

i al 0 al 0 



i.e. B reduces to the linear invariant of the kinetic stress. 
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Lastly, suppose that % is purely spaeelike and purely time" 
like; i.e. that dy 0 — 0, d'y t = 0 (i = 1, 2, 3), and therefore 

ds* = — dtf, ds'* = d'y 0 K 

Then B = S f r oi ^ 

i al 0 

and is therefore identical with the flux of the energy in the 
direction £ with its sign changed. 

We can now determine the physical significance of the T^s 
with reference to any system of co-ordinates whatever. This 
follows easily from the in variance of the form B if we allow 
that the physical significance of this form in the particular cases 
noted above remains the same in any other system of reference. 
The different cases are in detail: 

(a) The energy density at a generic instant and point x 0 , 
x l9 x 2 , x z will be what B becomes for purely timelike, i.e. for 

£o = £>o = 1 . § ^ = r = o (i> 0); 

i.e. it will be — 9°. 

9oo 

(b) The flux of the energy along a specified (spacelike) direction 

a of parameters a* = ~ will be what — B becomes when 
we put in it 

dx 

I dl l (i = 1, 2, 3), 

i.e. it will be 

1 s ^ 
— y 2^,0 a*. 

If in particular the direction a coincides with one of the co- 
ordinate directions, say x ft , we have 

1 
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and the other a* 's are zero; hence the flux of the energy in that 
direction is given by 

} T ho 

(c) The component in a direction a of the kinetic stress 
relative to a facet normal to a direction a' will be what B becomes 
when we put 

* ™ r*i ~ * ~ _ 

i.e. it will be 

3 

l 

If in particular the direction a coincides with that of one of 
the co-ordinate lines, say x n and a' with that of another, say x 89 
we shall have 

1 , 1 

v — y rr V — 17^ 

and all the other a's will be zero. Hence the component in the 
direction x r of the kinetic stress relative to a facet normal to x 8 
will be 

T 

Before concluding this section we wish to make one last 
remark. We have seen that when there are no stresses (the case 
of discrete particles of matter) the energy tensor takes the par- 
ticularly simple form 

T ik = € A, A, (65) 

Another important particular case is when the energy tensor 
has the form 

T lk := *A, A,. - pg iki .... (66) 

where p is any invariant function of the position and the time. 
In order to see the physical significance of this expression, con- 
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sider a specified point of V 4 , and take a system of co-ordinates 
which are, at least locally, pseudo-Cartesian, which we know 
is always possible. Then the g ik '& take the values S^, while if 
we make the direction x 0 coincide with that of the world line, the 
A/s all become zero, except Xq, which is 1. 
In these conditions we shall have 



^oo — 


e — 


P> 


Tuc = 


0 


(i 4= k). 


T u = 


■P 


(i > 0). 



The last two formulae tell us that on every facet there is 
exerted a stress normal to it and independent of its direction: 
the scalar quantity p measures the value of this stress per unit of 
surface. The medium under consideration therefore behaves like 
a perfect fluid (a fluid incapable of transmitting a shearing stress), 
and p represents its pressure. Tt is hardly necessary to point out 
that if p is negative it represents a uniform pull in all directions 
— which, within certain limits, is known to be a possible condition 
even in a real liquid. 

25. Relativistic form of the equations of motion of a con- 
tinuous system. 

In the particular case of no forces, we saw in § 23 how the 
general equations of motion of a continuous system can be put 
in the form 

2* Tj* = 0 (i = 0, 1, 2, 3), . . (67) 

o 

where the T 4 ,/s arc regarded as elements of a tensor, and that 
this equation holds in general co-ordinates x whatever may be 
the transformations (involving both space and time) imposed 
on the original co-ordinates y. The proof of this consists in the 
invariant character of the equations (67) (which express the 
vanishing of the divergence of the tensor T ik ), together with 
the fact that in the original co-ordinates y the equations (67) 
reduce tjp the form (62), and that the quantities T i1c become 
identical (neglecting terms of the second order, if not rigorously) 
with the expressions (63) which are their values in the classical 
mechanics. All this holds without change even if we drop the 
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particular hypothesis suggested by the law of transformation of 
the T ik 9 s when the transformations (T 4 ) are applied; that the 
forces are zero. It is only necessary to retain the tensorial character 
of the T lk '& in every case, as we have already agreed; which, in 
the particular case where stresses are present, means that 
their experimental values are determined, say, with reference 
to the co-ordinates which formed the starting-point of the 
investigation. 

The equations (67) thus hold so long as the metric considered 
is pseudo-Euclidean, and for any co-ordinates of reference what- 
ever. But the invariant expression for the laws of motion, which 
is seen to hold under this hypothesis, can be at once extended to 
the general case of any metric whatever, in virtue of the observa- 
tion made earlier in this book (cf. Chapter VI, p. 164) that in a 
first-order region every metric behaves as if it had constant co- 
efficients, and is therefore Euclidean in the proper sense in the 
case of a definite ds 2 , and pseudo-Euclidean in the cases which 
concern relativity mechanics (cf. §§ 6 and 12). In fact, the equa- 
tions (67) contain only contra variant derivatives of the T ik % or, 
in other words, combinations of their ordinary first derivatives 
with the g ik '& and their first derivatives; the argument thus does 
not go beyond tha consideration of a first-order region round the 
generic point which is being studied. 

26. A particular class of motions of a continuous system. 

In the classical mechanics the equation (52) of the motion 
of a continuous medium, when there are no forces and no mole- 
cular action (a discrete system), evidently reduces to 

f = 0, 

with which is to be associated the equation of continuity. It 
follows that the vector equation is satisfied at once by the uniform 
rectilinear motion of single particles, the density being then 
determined by the equation of continuity. This is conceptually 
evident; in order to translate it into a formula, we assign to any 
material particle, initially at P 0 , a velocity v(P 0 ) which is a 
function (a priori arbitrary) of the position P 0 : the geometrical 
equation of motion is then evidently 

P(t) = Po+ v(P 0 )*> • • - ■ (68) 
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which shows that the solution depends in substance on three 
arbitrary functions of three arguments each. 

If we wish to find an explicit expression for the law of variation 
of the density it is perhaps preferable to go back to the molecular 
equation of continuity instead of to the equation (5*3) which is 
its local form. It is a well-known result that if we introduce the 
functional determinant D of the actual co-ordinates y(t) with 
respect to the initial co-ordinates y 0 we get 

pD = p 09 

where p 0 is the initial value of p, and is a priori arbitrary just 
as is the initial distribution of the velocity. Projecting the 
equation (68) on the axes, and denoting the components of v by 
vdtfl Vl> y°s)> we get 

- Vi = ifi + vtt, 

whence 

It follows from this that D is a polynomial of the third degree 
in t, which reduces to unity for t — 0. Naturally (supposing that 
the i\ 9 s and their first derivatives are finite and continuous) the 
motion remains regular so long as JJ does not vanish; the smallest 
positive root (if such exists) of the equation of the third degree 
D = 0 determines the amplitude of the interval of regularity, 
&c. 

A particular case worth noting is when the density remains 
constant for each particle (incompressible systems). In this 

case ~ = 0, and the equation oi continuity, in the original 
at 

Eulerian form (53'), gives 

div(v) = 0 (69) 

This implies in particular that the divergence vanishes at the 
initial instant, and therefore gives as a necessary condition for 
the constancy of the density that the field of the initial velocities 
must be solenoidal, i.e. that divv(P 0 ) — 0. This condition is 
not however sufficient. In fact, if the density is to remain constant 
it is necessary and sufficient that D = 1 at any instant t; the 
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expansion of D as a polynomial of the third degree in t shows 
that this imposes three conditions, corresponding to the vanishing 
of the coefficients of t, t 2 , £ 3 , and (69) expresses only the first of 
these conditions. Further, if these conditions are satisfied initially, 

p remains constant for every particle (i.e. ^ = 0), which ensures 

that the equation (69) is satisfied at every instant, or in other 
words that the field of the velocities is always solenoidal. 1 

We have dealt at some length with this class of elementary 
solutions, because the results can easily be generalized for any 
F 4 whatever. If \(x 0 , x x , x 2 , x. A ) denote the moments of a generic 
congruence of lines in the V 4 , we know (cf. Chapter X, p. 274) 
that the necessary and sufficient condition tor the congruence 
to be geodesic is that the curvature vector, or, what is equivalent, 
its covariant components, shall vanish, i.e. that 

S^-i, A 1 =0 (i = 0, 1, 2, 3). . . (70) 

0 

We now propose to show that in a F 4 with any metric whatever 
we get solutions of the equations (67) by taking for world lines 
the lines of any geodesic congruence whatever, or, in other words, 
by supposing that the A's satisfy the equations (70) and by 
assigning a suitable value to the density p, and through it to the 
quantity e which appears in the expression (65) for the energy 
tensor of a discrete system (i.e. a syptem with no molecular action). 

Take the general equations (67), which we shall write in the 
form »j 

W**,, = 0 (i= 0,1,2,3) 

0 

and in them give T ik the value cA f A*. We shall have 

= A /f + *A t -| Z A* + cA £ A*|j 
and therefore by substitution 

\ i i k i9 U eiK + ehtf A*+ eKkW = 0 (i = 0, 1, 2, 3). 

0 0 0 

1 Cf. Gisotti : 41 Moti di un liquido che lasciano inalterata la distrihuzione 
locale delta prcssioni", in fiend, dclla R. Acc. dei Lineci, Series V, Vol. XIX (firHt 
half-year, 3910), pp. 378-376. The observation is there limited to the ease of 
permanent motion. 
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The second term vanishes in virtue of (70), and therefore the 
four equations reduce to the single condition 

3 s 

W^+eS^Ai* = 0. . . . (71) 

0 0 

If we choose e so that this condition is satisfied, the equa- 
tions of motion will all be satisfied also. 

Q. E. D. 

The equation (71) defining € can be put in a somewhat more 
expressive form by using the results (cf . Chapter X. p. 2(>7) that 

0 0 as 

where s denotes the arc of the world line, and noting that 

S,A]f — divX. 
0 

Hence (71) can be written 

^ € +€divX =0 .... (71') 

as 

which is precisely the form of the equation of continuity. 

If in particular we consider a solenoidal geodesic congruence 
(divX = 0), the last equation becomes 

P = °> 

as 

whence e = constant along any world line; i.e. the density of 
a particle remains constant throughout the motion. 

27. Experimental determination of the coefficients of an 
Einsteinian ds 2 . 

We shall close this chapter by some remarks of a general 
character on the experimental determination of the coefficients 

9ik- 

We suppose ourselves fixed in determinate physical conditions, 
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so that, as already noted in § 10 and § 12, we must also regard 
as determinate the Einsteinian 

8 

ds 2 = ^g^dXidXf, . . . . (72) 
o 

of the field which we wish to explore by means of suitable experi- 
ments. It is of course understood that we admit the validity of 
the fundamental postulates of general relativity, and more 
precisely: 

(a) {cf. § 16) the propagation of light always takes place in 
such a way that 

ds 2 = 0 (73) 

along every world line; 

(b) (cf. § 12) the world lines of the motion of a material particle 
in a field of force for which ds 2 can be expressed by (72) are 
timelike geodesies for this ds 2 . 

We propose to show that (a) suffices to determine the ratios 
of the coefficients g lJn or, which comes to the same thing, gives 
ds 2 except for a factor which can in turn be found from (6). Of 
the four parameters, x 0 will as usual denote the time, in the sense 
of the conventional time, measured at any single point by a clock 
which may be of any kind and even incorrect. However the 
timelike parameter x 0 is chosen, the mere fact that it is timelike 
implies, according to the Einsteinian theory, that ds 2 will always 
be greater than 0 if x 0 alone varies, x l9 x 2 , # 3 remaining constant. 
But, when dx 1 = dx 2 = dx$ = 0, ds 2 reduces to g in) dx 0 2 9 so 
that the coefficient g w necessarily > 0, and we can therefore put 

<7oo = W (74) 

where c is a positive constant (introduced for the sake of homo- 
geneity) and v, like g m , is an unknown function of x 0 , ct^, x 2 , x z 
(a pure number, i.e. of zero dimensions). 

We shall now choose any instant x 0 we please, and three 
values x ly x 2 , x% of the space co-ordinates, i.e. a point P; we 
propose in the first place to determine the ratios of the g 9 s at P 
and at the instant x 0 . 

For this we shall use light signals between P and very near 
points in the surrounding physical space, which is by hypothesis 
(at any given moment) in one-to-one correspondence with the 
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sets of three co-ordinates x v x& # 3 . In consequence, surfaces ind 
lines in this physical space represented by equations between 
x Xi # 2 , £3 at the moment # 0 are perfectly determinate: in particular 
the lines x 1 (given by the equations x^ = constant, x z — con- 
stant) on which only x 1 varies, the lines x z , &c. 

We shall choose two points Q and Q' very near P, on the same 
line x 1 as P. Suppose that Q and Q' correspond to increments 
(to be treated as infinitesimals) dx 1 and — • dx x of the co-ordinate 
x x ; dx^ dx$ are zero in both cases since the displacement is along 
a line x 1 . 

Suppose that two light rays start from P at the instant x 0 , 
one towards Q, the other towards Q'. Let x 0 + dx 0 be the 
instant when the first ray arrives at Q\ x 0 + d'x^ the instant 
(not in general the same as the first) when the second ray arrives 
at Q'. Using the expression (72) for ds 2 and the condition ds 2 — 0 
for the propagation of light, we shall have in passing from P to Q 

9oo dx o 2 H ~ 2 <7oi dx o dx i + 9x1**!* = 0. • (? 5 ) 

and in passing from P to Q' 

<7oo d ' x o 2 — 2 9oi d ' x o dx i + <7n dx i = 0. . (76) 

These two equations, in which dx^, dx 0 , d'x 0 are known (the 
first chosen as we please, the other two found by experiment), 

obviously give the ratios ^ 0 ~, It is to be noted that if the 

#00 ffoo 

elementary times of propagation dx 0 , d'x 0 (found by observation) 
are equal, then (75) and (76) give by subtraction g 01 = 0. 
Reciprocally, if g m ~~ 0, the two intervals of time must be equal. 
Hence the elementary propagation of light in the direction of a 
line x x is a reversible phenomenon if and only if g 01 = 0. 

In the same way, considering the other two co-ordinate lines 
x 2 and x 3 , we can determine the four ratios 

.9o2 ffz2. fl'oa #33 
> — > '9 • 

To obtain the other three ratios 



#23 #31 9l2 

9oo ffoo 9qo 
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we must make further experiments of ttie same type, but with 
the point Q in a direction other than those of the co-ordinate 
lines. 

Thus to determine - 28 , we can use a line on the surface through 
<7oo 

P, x 1 — constant, which is neither a? 2 nor x z \ e.g. 

#3 — #2 — constant. 

We then have, in passing from P to a very near point Q on this 
line, the increments 

with dx 2 arbitrary. 

If we make a light ray start from P at the instant x 0 towards 
this point Q 9 and if dx 0 denotes the small time of propagation, 
we get from (72) divided by g w 

dx 2 -J- 2 902 dx 0 dx 2 + 2 9 mx 0 dx z 
ffoo ffoo 

+ ° 22 dx 2 + 9 J* dx 2 + 2 923 dxf = 0, . (77) 
ffoo ffoo ffoo 

whence we got the ratio ~ 2 » , all the other quantities in this equation 

ffoo 

being known or already determined. In a similar way we can 

find g ™ and 9 ™. 
ffoo ffoo 

It is not inapposite to add that from other experiments of the 
same type we can get any number (in fact an infinite number) 
of further equations between the ratios of the gr's. The con- 
sistency of these results, in so far as this is borne out by the 
further experiments, affords a very significant control of the 
validity of the Einsteinian hypothesis so far as concerns the 
postulate (a). 

The ratios 

Si* = & (h k = 0, 1, 2, 3) . . (78) 
ffoo 

being thus determined, if we put 

<fos = g w ds'* = c 2 e 2v ds' 2 . . . (79) 
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(using (74)), it follows that the individual coefficients of the 
differential form , 2 

ds' 2 = _ 

are all known, and therefore the form itself is completely deter- 
mined. 

From (72) and (78), separating out the terms which contain 
the suffix 0, we get ds' 2 in the form 

3 3 
ds' 2 = dx 0 2 + g 0t dx 0 dx, + E A ^ dz 4 da^ . (80) 
1 1 

At this point we find that we have to determine the function 
v by gravitational experiments, and more precisely by experiments 
on the motion of material particles in the field in which the 
expression (79) holds for ds 2 . 

The equations of motion are included in the variational 
equation 

8[ds 0 (81) 

Now suppose that the time x 0 is taken as the independent 
variable along the trajectory. Let x £ (i = 1, 2, 3) denote the 

dx 

derivatives *; and using (80), put 

UXq 



ds' I 3 , 3 



dx. 



01 1 
= L (Xq J x 2 , x§ I Xi, x%, Xg). 



(82) 



Then, remembering (79), the variational equation (81) can be 
written in the form 

8f(e"L)dx 0 = 0 (81') 

This is equivalent to the three Lagrangian equations 

**%B->m-o (i=l,2,3). 
ax 0 dx £ ox i 

Noting that v does not depend on the x% and putting for the 
sake of brevity 

**-|£-a • • • < 83 > 

da^ a;p 0 vx^ - 
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It follows that. 

It is further to be noted that direct observation of the motion 
enables us to determine how the co-ordinates x i vary as functions 
of the time x 0 , so that we must consider the functions ^(a? 0 ), 
and therefore also the derivatives x t and x n known for every 
material particle left to itself in or projected into the field of force 
we are considering. It follows that the quantities a,-, p i defined 
by (83) are also known. Since 



dx 0 dx$ i 1 d x t 



it follows that ultimately the equations (84) are three linear 
equations in the four partial derivatives of the unknown function 
v. If we fix a generic point P and an instant x 0 , any arbitrary 
choice of the velocity of the body under experiment (i.e. of the 
three numerical values to be assigned to x l9 i a » sr 3 ) will give three 
equations in the four derivatives 

dv dv dv dv 
dx 0 * dx^ dx% 3.X3 

referred to the given position and time. The equations are there- 
fore more than sufficient to determine the numerical values of 
these derivatives, in the sense that by making a larger number 
of experiments we can not only determine the four unknowns, but 
also test the accuracy of the results as many times over as we 
wish. 

The derivatives of v> at every point in a certain field and at 
every instant in a certain interval, being known, v itself is 
determined except for an additive constant; hence, from (74), 
g w is known except for a constant multiplier, which we may 
suppose absorbed into the factor of homogeneity c 2 , so that c 2 
remains arbitrary. The presence of this constant in the expression 
for (fa), and hence, by (79), in ds 2 , seems to be in the nature of 
things, corresponding in substance to the choice, which remains 
arbitrary, of the unit chosen to measure ds 2 9 the space-time 
interval. 
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CHAPTER XII 

The Gravitational Equations and General Relativity 

1. Qualitative properties of the coefficients of ds 2 . 

It follows from the results in the preceding chapter (p. 325) 
that when the variables of reference y 0 , y^ y 2 , y z are such that 
they can be interpreted, without sensible error, the first as 
absolute time, and the others as Cartesian co-ordinates, then 
the coefficients g ik of the Einsteinian ds 2 of space-time, in con- 
ditions corresponding to the motion of the celestial bodies (in 
particular, of the bodies forming the planetary system), differ 
by very little from 8^, the difference being of at least the second 
order, in the sense explained above. More precisely we can say 
that: 

(a) The coefficient g w differs from 1 — — by terms of order 

c - 

higher than the second (cf. p. 320 in the preceding chapter), 
where U represents the ordinary Newtonian potential of the 
field considered. 

(b) The coefficients g 0£ (i > 0) are of order higher than the 
second. If in fact they were only of the second order, it follows 
from p. 339 in the preceding chapter that the difference between 
the velocities of propagation of light in the various directions 
round a point would also have to be of the second order; 
this, however, is physically inadmissible, as a difference of 
this magnitude could be detected by means of optical experi- 
ments. 

(c) The other coefficients g ik (i, k>0) differ from 8- k by 
terms of the second or higher order. 

Now let us consider the absolute motion of a generic material 
particle P, e.g. a small planet. Let u(P y P') be the Newtonian 
potential of the attraction exerted on it by any particle P' of 
the other attracting bodies, which we shall suppose to be of fairly 
large mass compared with P, as is in fact the case in the typical 
examples offered by astronomy. The disturbing effects of P on 
the motion of P' being supposed negligible, the dependence of 
u on the space co-ordinates y l9 y 2> y z involves the co-ordinates 
of P, while its dependence on the Romerian time y 0 involves 

(D655) tA 
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the co-ordinates of the attracting body P 9 . If SP is a generic 
displacement (of components S^) of the point P, and 

8u 2, — Sy £ 

is the corresponding increment of u 9 we have 

Su = P x SP, 

where P is the force exerted on P by P\ Further, if we consider 
a small interval of time dy 0 and denote by dP' the displacement 
of P' dtiring that interval, and by du the increment of u, we get 
similarly, applying the principle of reaction, 

du = — P X dP\ 

After this it is easy to determine the order of magnitude of 
the timelike derivative of u in relation to the spacelike derivatives. 
In fact, from the first formula, putting SP = nSZ (where n is 
the Versor of a generic direction) we get the well-known result 
that the derivative of u in this direction has the value P X n, 
and is therefore of the same order of magnitude as the intensity 
F of the force; while from the second formula, on dividing by 
dy 0 > it follows that 

dy 0 dy 0 c dt 

which shows that the order of magnitude of this derivative is 
that of /?P (with the usual meaning of fi). Hence, in the supposed 
conditions, the timelike derivative of u is of the first order in 
relation to the spacelike derivatives. The same result holds 

2U 

without change for cfo, which, as we have just said, is 1 

c 2 

(neglecting terms of order higher than the second), U being a 
sum of terms of the type just considered. 

Taking the case of g w as typical, we shall assume, in ordinary 
astronomical conditions, that: 

(d) The derivatives of the coefficients with respect to y Q 
are of higher order by at least one unit than the analogous deri- 
vatives with respect to the other y's. 
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We can sum up all this in the statement that if we are content 
with approximate results (meaning that we stop short at terms 
of the second order), everything happens as if the coefficients 
g i0 were zero, and the other ^'s independent of y 0 . This is 
equivalent to the statement that, to the given order of approxi- 
mation and in ordinary astronomical conditions, every ds 2 behaves 
as if it were statical (cf , Chapter XI, p. 326). 

2. The tensor G ik and its divergence. The gravitational tensor. 

We have already noted (cf. Chapter VII, p. 200) that for any 
V n whatever we can construct from the Biemannian tensor the 
symmetrical double tensor 

© 4 = S^o* (ij,hk), .... (1) 

1 

and its linear invariant 

G-S^G,* (2) 

1 

This definition naturally holds also for an indefinite metric: 
in particular therefore for the ds 2 of relativity (n =■ 4), in which 
case the tensor under discussion is called the Einstein tensor; 
its components are 

X jf ,g*(v,hh), .... (10 

0 

and its linear invariant therefore takes the form 

G = G ik = 2^9*9* (V, hJc). . . (SO 

o o 

We may note incidentally that for a V 2 the tensor G^ is 
related to the fundamental tensor g ik and to the Gaussian cur- 
vature by the formula 1 

G ik - - Kg ik (i, k = 1, 2);. . . (3) 

1 In fact, for n = 2, it follows from the definitions of K (p. 194, formula 
(26)) and of the €- systems (Chap. VI, p. 158) that ((/» hk) ~ Kc,j € hki as can 
at once be verified, rememberim* that the symbol (ij t hk) either reduces to 
(12, 12) =s Ka y or vanishes. Further, with the same definition of c, we have 
2 

also the identities 2 } h g> h e„ ^ = — g^. Replacing (ij, hk) in the formula of 
type (1') by JSTey *hk> and using this identity, we get (3). 
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while for a F 3 the quantities reduce to Ricci's symbols (cf . 
Chapter VII, p. 199) 

„« - (j+i »+ 2 » *+* k + ® 
a — , 

a 

the relation being 

G ik = - .... (4) 

where t^/ denotes the mean curvature of the V 3 , or in symbols 

«5kf = a* (5) 

1 

For n — 4, from the general formula 

_ n«(n«-l) 
12 

of Chapter VII, p. 182, it follows that in general the Riemann- 
Christoffel tensor has 20 algebraically independent components, 
while the elements G ik of the Einstein tensor provide only 10 
linear combinations. This simple arithmetical remark shows that 
the Einstein tensor cannot exhaust all the curvature properties 
of the V 4 , but, as we shall see, it does suffice to give those of 
essential physical importance. 

Before beginning the examination of this question, we shall 
find the expression for the divergence of the tensor G ik . From 
(1'), we have by covariant differentiation 

O ik[l - ij jh g* {ij, hk) b 

so that the components of the divergence (cf. Chapter VI, p. 153) 
Yi= hG%= h^G^ . . . (6) 

become 

Yi = X jh H9 jh 9* 1 <tf, hk) t . 

0 

In virtue of the relations 



hk) (hk, vj), 
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Biancbi's identities (formula (17'), Chap. VTI, p. 183) enable us 
to substitute 

for {ij y hk) h so that we have 

3 3 

0 0 

The first term is merely 6?,, as follows from covariant differen- 
tiation of (2') } which by interchanging the indices can be written 
in the form 

0 

Interchanging j and Z, and also h and k 9 in the second term, 
it becomes 

and in view of the identity 

(ji, Jch) = (i/, M) 
it obviously reduces to — Y £ . We therefore have 

= Wi (7) 

which in virtue of (6) can also be written 

ifctfj? - iff, = 0 (7') 

0 

Since the divergence of the tensor Gg^ (proportional to the 
fundamental tensor g ik ) is 

ktiGg*)^ 1 - hoXgikif 1 =-- G b 

0 0 0 

it will be seen that (7), or the equivalent equation (7'), expresses 
the property that the divergence of the tensor 

Gik — I Gg ik 

is zero. This tensor is called the gravitational tensor ; the name will 
be justified farther on. 
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3. Solidarity o! physical phenomena. Criteria for the con- 
struction of the gravitational equations, and reduction of the 
inductive proof of their validity to the statical case. 

In the immediate vicinity of a point and instant fixed in 
advance, a mechanical phenomenon is completely determined 
(at least conceptually) if we know, at the specified point and 
instant* the density and velocity of the matter (or, which comes 
to the same thing, of the energy), and the distribution of the 
specific stress, which includes as a differential consequence the 
determination of the external force; the latter, however, as 
already noted (p. 349) in the preceding chapter, can be supposed 
absorbed into the stresses, the concept of action at a distance 
being as before excluded. In substance, therefore, the local 
behaviour of a mechanical phenomenon is completely determined 
by the knowledge (which is both necessary and sufficient) of the 
energy tensor T iJr 

This remark has a more general scope, since it holds also 
for phenomena other than mechanical (e.g. electromagnetic 
phenomena). 

Einstein's fundamental view is that the aggregate of physical 
phenomena influences the metric of F 4 ; more precisely, that at 
every point P of the F 4 there must be a local relation between 
the value of the energy tensor, which may be taken as charac- 
teristic of the physical conditions, and the behaviour of the 
curvatures of the F 4 at the point. As an abstract hypothesis, 
the possibility of some such influence, limited however to the 
spatial metric, had already been suggested independently by 
Riemann and by Clifford, Einstein completed it, applying it 
not only to the spatial metric, but to the metric of the space- 
time which includes both space and time and also, as we saw 
in § 4, p. 291, and § 10, p. 320, when studying the motion of a 
material particle, the force in the field, which is represented 
through the coefficient g^. 

We have pointed out just above that from the mathematical 
point of view the external force can be considered as produced 
by a suitable distribution of stresses. From the point of view 
of the classical mechanics this principle could also be applied to 
the particular case of forces of gravitational origin; Einstein, 
however, assigns a privileged position to these forces, and 
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supposes that all actions of gravitational origin (and only these) 
are so intimately fused with the geometrical and temporal 
properties that they are directly determined by the four-dimen- 
sional ds*. Such a possibility is amply justified by the considera- 
tions set forth in the preceding chapter (pp. 291-328). All the 
other non-gravitational forces (in particular, actions of electro- 
magnetic origin), on the contrary, can be absorbed into the 
energy tensor. In order to put this view in a mathematical form, 
Einstein had to establish a relation between the ds 2 (i.e, its ten 
coefficients) and the energy tensor (i.e. the ten functions T ik ); 
he had therefore to determine ten equations. One of these was 
a necessary consequence, at least approximately, of the New- 
tonian theory. In the classical mechanics space is considered 
rigorously Euclidean, and by Newton's law the density p of the 
attracting matter determines the field of force by means of the 
Newtonian potential 



where /is the gravitation constant, and the meaning of the other 
symbols is as usual. From this expression for U Poisson's equa- 
tion 



follows in the ordinary way for every point of the field. Since 
the density p differs from the element of the energy tensor 
only by a constant multiplier (pp. 349, 354, §§ 22, 24, and 25), 
while to a first approximation (cf . § 4, p. 291 ) we have 



it follows that Poisson's equation establishes a relation between 
the fenergy tensor and a sum of second derivatives of g m . 

The differential equations expressing the relation between 
the coefficients of ds 2 and the quantities must therefore 
include this relation, at least to a first approximation. A reason- 
able induction suggests that in order to construct the ten required 
equations we must equate the ten components T ik of the energy 
tensor to ten differential expressions of the second order in the 
coefficients which, the system being invariant, must them- 




A 2 U = - irrfp 
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selves constitute a tensor. Now a double tensor of the second 
order is .given by those combinations of the Riemann-Christoffel 
tensor which we considered in the preceding section. Accord- 
ingly, the procedure which would first occur to one would be to 
assume that the G^'s were equal or proportional to the T tk '&; 
and this was in fact what at his first attempt Einstein did. But 
immediately afterwards he reflected that the fundamental equa- 
tions must not impose on the metric properties of space-time any 
a priori limitation, in this sense that any value whatever of ds 2 
must be capable of being regarded as theoretically possible pro- 
vided there is a suitable energy tensor. This property would be 
inconsistent with the condition that the G^s and T lJS /s are to 
be proportional, since the latter tensor, from its physical origin, 
satisfies four differential conditions expressing the vanishing of 
its divergence (cf. pp. 351, 359, §§ 23 and 25), so that the G>'s, 
would have to be connected by corresponding equations. The 
idea of a linear relation between the two tensors can however be 
retained without imposing any differential relation on the fl^'s, 
since the divergence of the tensor 

Gat — 1^9* 

is identically zero, as we saw in the preceding section. If in fact 
we put 

G tk -lGg ik - -icT* .... (8) 

where k denotes a constant (to be subsequently connected with 
the constant / in Poisson's equation), there will be no resulting 
differential relations between the g^'s. These are the celebrated 
gravitational equations. The foregoing considerations serve 
merely to give them plausibility from the purely formal point 
of view; their physical justification follows a posteriori from 
arguments of two kinds, which, we shall now explain. 

For the moment we consider only a first approximation; i.e. 
we suppose that ds 2 differs from the pseudo-Euclidean value by 
a small amount. As we saw in Chapter XI, p. 320, we may on 
this hypothesis assume 



g w 1 — 2y 

g oi = _ y . 



(i, k «= 1, 2, 3), 
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where the y 9 s are small quantities of the second order. We also 
saw that (still with the same hypothesis) the equations of motion 
of a material particle, to a first approximation, depend neither 
on the y/s nor on the y ik 's, but only on the coefficient g W9 or, 
which is the same thing, on the function y, and that they in fact 
reduce to the classical Newtonian equations 

4 = |£ a = i, 2, 3), 

since y = (9) 

c 2 

In view of this, the problem of justifying the gravitational 
equations to a first approximation reduces to that of proving: 

(a) that one of these equations (the one corresponding to 
i = k — 0) involves only y (i.e. U) and is identical with 
Poisson's equation; 

(6) that the other nine are consistent with values of the 
functions y of the assumed order of magnitude: their precise 
values in this first approximation are a matter of complete 
indifference, since whatever they may be we in any case arrive 
back at the Newtonian formula. 

We can therefore limit the scope of (a) and (b) to the statical 
case, for the reasons indicated at the end of § 1. 

The passage to a further approximation in the equations of 
motion of a material particle involves (cf. Chapter XI, p. 320) 
either the values to a first approximation of y { and y^ or the 
third-order correction 0 in the expression for g^. It is this 
difference from the results of the Newtonian laws which, being 
within the range of astronomical observation, provides a means 
of testing whether Einstein's hypothesis is or is not superior to 
its classical predecessor. 

At this point we are, so to speak, in conditions analogous 
to those in which Newton found himself when he substituted 
for Kepler's kinematical laws the dynamical principle of universal 
attraction, which was capable not only of including Kepler's laws 
as a first approximation, but also of predicting, and that on a 
magnificent scale, new facts which have since found marvellous 
confirmation. When the relativity theory is substituted for the 
' Newtonian, the phenomena predicted by it are much more 

(DG55) 13* 
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minute, but even with present experimental resources, some at 
least of them are within the reach of experiment. This experi- 
mental control provides the second line of argument alluded 
to above in support of the gravitational equations. 

4. General equations of Einsteinian statics. Empty space. 

When we are dealing with statical phenomena (cf. Chapter 
XI, p. 326), the ds 2 of space-time has the form 

ds* = V*dx 0 *-dl* . . . . (10) 

8 

with dl 2 = H ik a ac dx i dx k (10') 

i 

The coefficients a ik , like V, are to be functions of x 1> a^, a^, 
only; V is interpreted (cf. Chapter XI, p. 339) as the velocity of 
light, and is therefore considered essentially positive. 

With obvious meanings for the symbols, we have 

9ik—— ®ik> 9oi =0, #oo= V 2 > 9 = " <* V 2 , 

^ = 0, 9°°= -~ z , <*,*=1,2,8). 

We shall use a dash (') to denote Christoffers symbols and 
the components of the Riemann-Christoflel and Einstein tensors 
relative to the quaternary form (10), and shall keep the ordinary 
notation without a dash for the analogous symbols and com- 
ponents relative to (10'). 

From the definitions and (11) we get 

{ile, iy = {ik, Z}, 

{ik,0}' = {0i,k}' = {00,0}' = 0, 
{i0,0}' = £, 

{oo, %y = v r, 

where i, k, I, can take any of the values 1, 2, 3, F f = and 
V i — HjdVVj is the reciprocal system with respect to the 
-purely spatial cK 3 . 
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We shall next express Riemann's symbols of the second kind 
for the quaternary ds 2 in terms of the analogous symbols for 
<#* and of V. We have by definition, from formula (3) of Chapter 
VII, p. 175, 

- &, [{ih, ry {ik, iy - {ik, ry {ih, iyi 

0 

We shall examine separately the various cases which may 
occur, according to the number of the indices i, r, A, k which are 
zero. 

(1) No index zero. The first group of (12) gives immediately 

{ir, hk}' = {ir, hk} (13) 

(2) A single index zero. Riemann's symbols being anti- 
symmetrical with respect to the last two indices, we need only 
examine the three cases in which the zero index is i, r, or h. In 
each case, from the second group of the formula? (12) it follows 
immediately that Riemann's symbols of this type are all zero, or 

{Or, hk} f = {iO, hk}' = {ir, Ok}' = 0. . (14) 

(3) Two indices zero. From the general properties of the 
Riemann-Christoffel tensor the symbols of the type {ir, 00}' 
vanish identically (for any ds 2 ), and those of the type 

{00, hk}' = ijgJ<>(0j 9 hk) 
o 

vanish whenever g J0 = 0 (for j > 0), as in our case. There 
remain therefore to be considered the two types {Or, 0k}' and 
{iO, 0k}'. 

From (12) and the fundamental formula of covariant differen- 
tiation with respect to the purely spatial dl z we find 



{Or, 0k}' = 

{io, oky _ 



(14') 



(4) Three or four indices zero. It will be seen immediately 
from (12) that these symbols are all zero. 
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We are now in a position to evaluate explicitly the sym- 
metrical double tensor G^, the elements of which, .as we know 
(§ 2), are 

3 3 

G'ik = ?>h{ih, M}' = ^ h {ih, hk}'+ {iO, OA;}'. 
Introducing the analogous system 

= i h {ih 9 hk} 



relative to the ternary form dl 2 , we find at once, using the 
expressions obtained for the symbols {ir, hk}' 9 

G' ok = 0, \. • ■ (15) 

<&= -fs a fj = — VA Z V. 

1 

From these formulae and (11) we get for the linear invariant 
of the system 6r' fr , 

0 1 

= -2 A *J--i ik a*G ik (16) 

V i 

We have already seen (Chapter VII, p. 200) that for a three- 
dimensional manifold we can with advantage replace the tensor 
G ih by Ricci's tensor a ik9 the linear invariant 

i 

of which (cf. Chapter VII, p. 203) represents the mean curvature 
(the sum of the three principal curvatures). 

The G&& and a ik 9 s are connected by the linear relations 

®ik — *ik ~ £M a ik ; 

from this, multiplying by a ik and summing with respect to i 9 k, 
there follows in particular 

G = <M — 3<M = — 2<M. 
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Applying these results, (15) and (16) become 



fi'io = 0, (i, k = 1, 2, 3) 



(15') 



JG" = ^-^1, .... (16') 

which provide convenient expressions for the components of the 
Einstein tensor and its linear invariant in statical conditions. 

We can now return to the gravitational equations (8) of the 
preceding section. We note in the first place that since in statical 
conditions there is no energy flux, the components T oi vanish. 
Hence from (11) and (15') three of these equations reduce to 
pure identities, and there remain seven; six of these, corre- 
sponding to non-zero values of the indices, have the form 

+ -/ - ^f-a ik = - * T ik (t, k — 1,2,3) (17) 

in virtue of (15'), (16'), and (11), while the seventh, fori = h = 0, 

- VA 2 V - $G' goo = -kT w , 
or, from (15') and (16'), 

M= #c~™ (18) 

These seven equations 1 (17) and (18), as is naturally to be 
expected, reduce the Einsteinian statics to the three dimensions 
of the associated space. Their form is invariant with respect to 
the metric of this space, which has the dl 2 in question as its 
fundamental quadratic form. They also involve, in association 
with the fundamental form, the two invariant functions V and 
T^, and the covariant double system T ik (i, k = 1, 2, 3). The 

T 

latter characterizes the distribution of the stresses, while 

V 2 



1 Of. Lbvi-Civita: Rend, delta ft. Acc. dei Zincei, Vol. XXVI (first half -year, 
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is to be interpreted as the energy density (cf . Chapter XI, j>, 357), 
V representing the velocity of light, as was said at the outset* 

With regard to the energy density it is to be observed that 
no example of a negative density exists, 1 at least within the 
range of the better-known phenomena to-day, whether material, 
or electromagnetic in the broad senBe. Hence we .may assume 
that the right-hand side of (18) > 0, and we get the following 
geometrical corollary: The mean curvature tM> determined in 
physical space as the effect of purely statical phenomena, is in every 
case either positive or zero. 

An important consequence of the equations (17) is obtained 
on multiplying them by a* and summing with respect to the 
two indices. Using the definition of £M and (18) we get 

~f - *"( T + £)■ ' ■ • • < 19 > 

where ' T= S^Z 1 * (20) 

i 

and obviously represents the linear invariant of the system qf 
stresses with respect to our dl 2 (of the associated space). It may 
be remarked incidentally that this invariant must not be con- 
fused with the scalar invariant of the four-dimensional tensor, 
namely, 

o 

the value of which, from (11), is on the contrary 

T ^ 00 T 

V* 

Consider in particular a region of space in which all the 
components of the energy tensor vanish (empty space). From 
the physical point of view, this condition can be considered 
satisfied when the region in question contains neither ordinary 

1 In fact* if at a given point there is matter at rest distributed with density 
l, this implies an energy cPp of material origin, which in normal conditions 
enormously outweighs all other possible contributions to the total. Moreover, 
the electromagnetic contribution to the energy density also > 0. Hence even 
when there is no matter it does not seem possible for the energy density to have 
a negative value. 
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matter nor electromagnetic energy, since in this case it follows 
from the mechanics of material media that the stresses of material 
origin vanish, and, from Maxwell's theory, that the Maxwellian 
field of force vanishes, and therefore also the Maxwellian stress. 1 
With this hypothesis the equations (17), in view of (19), 
plainly reduce to the form 

A 2 F = 0, (21) 

a ik + = 0 (t, h = 1, 2, 3), . . (22) 

the first of which shows that not the timelike coefficient g w = V 2 
itself, but its square root, is a harmonic function. Also, (18) 
gives at once 

OI = 0 (21') 

If the energy tensor were zero throughout all space, it is 
intuitive from the physical standpoint that the Einsteiriian ds 2 
would be rigorously pseudo-Euclidean, and therefore the associated 
space rigorously Euclidean. This in fact represents the starting- 
point of Einstein's speculative construction, which assigns any 
deviation from a pseudo-Euclidean metric to those physical 
actions which are included in the energy tensor. Serini 2 too has 
given a rigorous proof of the hypothesis, based on equations 
(21) and (22). 

5. First approximation. Connexion with Poisson's equation. 3 

If we suppose that the expression (10) for ds 2 differs by very 
little from the Euclidean type referred to Cartesian space co- 
ordinates and Romerian time 

we can put (cf. § 3, and Chapter XI, § 10, p. 320) 

V = 1 - y (23) 

a a = 8* + y a (i, k = 1, 2, 3). . . (24) 

1 See e»g. Jkans: The Mathematical Theory of Electricity and Magnetism, fifth 
edition, 1925, Chap. VI, Cambridge University Press, 

* Rend. deUa R. Ace. dei Li?wei, Vol. XXVII (first half-year, 1918), p. 285. 
3 Levi-Givita, loco cit., and ibidem (fcecotid half-year, 1917), pp. 307-817. 
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We thus have 

dP = X ik ax dy t dy k = dl* + 2* 7ik dy i dy ki (24') 
i i 

where dl 0 2 is the line element of ordinary Euclidean space referred 
to Cartesian co-ordinates. 

The quantities y ik are pure numbers, like y, and the qualitative 
property we have assigned to d$* is equivalent, to a first approxi- 
mation, to treating all these seven quantities as infinitesimals. 

It follows that ChristofiePs symbols 



\dy h d Vi dyj 



are also infinitesimal. Since to the same order of approximation 
the quantities a jr keep their Euclidean values 8% it will be seen 
that the symbols of the second kind 

{ih,r} = Xjat'lihJ] 

do not differ appreciably from the homologous symbols [ih 9 r] 
of the first kind. It follows that from the definition of Riemann's 
symbols (p. 175, formula (3) ) we have, neglecting terms of 
higher order, 

= 4 c * r] -4 c * r] 

X d Vrdyh Sy ( dy k Sy r dy k dy.dyj' 

Hence it follows, from the definition of the G&8 (cf . § 4) and 
from (24), that 

G a = Z h {ih,hk} 

= is f 92 y«+ 82 r*» - 82 * » 8 v**l 

We now return to the statical equations (17) and (18). We 
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have already made them contain Ricci's symbols a ik instead of 
the G^a, the relation between the two being 

It is to be noted that 3ii is now to be considered infinitesimal, 
like the O lJe 9 s and their linear invariant, so that, from (18), T m 
is also infinitesimal. Replacing JUby its value (18), the explicit 
expressions which represent the a^'s to a first approximation 
take the form 



i A W dyjy* fad* tyidy'J 00 ' 

(i, k ~ 1,2, 3) 



(25) 



Using this result, and noting further that, neglecting infini- 
tesimals of higher order, the co variant derivatives of V =-=* 1 — y 
do not differ from the Ordinary derivatives, so that in particular 

at 

i dy? 

we find that (17) and (19) can be written as 

a* - (yk + AJlyS? — — icTf,,, . . (26) 
ASy = -l- K (T+T w ), . . . (27) 

where the symbols {y) ik denote covariant derivatives of y; 
and in particular, in empty space, since the terms on the right 
vanish, they become 

«* = (y)a. (26 7 ) 

ASy = 0, (27') 

which to a first approximation, as is naturally to be expected, 
are identical with (22) and (21). 

At this point we must consider the mechanical significance 
of the function y, or, better, of the product c 2 y. On p. 293, Chap. 
XI, when dealing with Einstein's modification of Hamilton's 
principle, we saw that, when ds 2 is very close to the pseudo- 
Euclidean form, the difference between g w and unity is — 2 ~[ 
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to a first approximation, V being the potential of the field of 
force in which the motion takes place. In the present case this 
difference is — 2y, bo that we have 

»-£ » 

This conclusion could of course also have been deduced 
from the general proposition in § 12 of Chapter XI, p. 328, that 
— £ c 2 V 2 (together with a non-essential additive constant) con- 
stitutes the potential function of the force exerted in the field 
in statical conditions. In our case V 2 — 1 — 2 y, and therefore 

_ \<*V 2 = -**(l-2y) = -| 2 +cV, 

which proves the required result. 

Now let us for the moment again take the standpoint of the 
classical mechanics, and consider the field of force due to a generic 

distribution of matter of density p = 5 , where € is the corre- 

c 

sponding energy density. If U is the Newtonian potential of 
this field, we know that Poisson's equation 

A 2 l/= -infp- 

holds, / being the coefficient of universal attraction. If on the 
other hand we take the standpoint of general relativity, the 

same distribution of matter gives a ds 2 for which y — — , and 

an energy tensor whose component T m coincides with e, while 
the components T 0t - vanish in statical conditions, so that the 
remaining components T tt represent stresses (cf. Chapter XI, 
p. 358). If we are dealing with discrete matter, the components 
T iky and therefore also their invariant are zero, and (27) 
becomes 

In order that this may be identical with Poisson*s equation, 
it is necessary and sufficient that the constant k of the gravi- 
tational equations and the universal constants / = 6*7 X 10"" 8 
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and c == 3 X 10 10 (in C.G.S. unitB) of the classical mechanics 
should be connected by the relation 

« = m 

which gives in round numbers (C.G.S. units) 

k = 2 X IO- 48 . 

For the remainder of the argument we shall adopt this value 
of k, and shall definitely take up the standpoint of relativity. 
In relation to the remarks in § 3 we can at this point consider that 
the preliminary justification of the gravitational equations is 
terminated. In fact, their first approximation is represented in 
statical conditions by (26) and (27). The equation (27), as we 
have now proved, is identical with Poisson's equation; the 
equations (20), as we shall see in the following section, serve to 
determine the quantities y ik9 which to a first approximation, as 
we have already said, do not influence the motion, but will 
become essential when we come to discriminate on a more refined 
scale between the Newtonian mechanics and the relativity theory. 
Here we have referred specifically to the statical case, but the 
justification of the gravitational equations obtained in this case 
also holds good, as already pointed out in § 3, in the general 
case, provided the coefficients y t of the product terms in dx 0 dx { 
(i = 1, 2, 3) are of order higher than the first. We have arrived 
at this condition by a process of induction from experimental 
facts, and have used it to reduce the ten gravitational equations 
to the seven of (17) and (18). We are now so to speak at the 
deductive stage, and must first show that the gravitational 
equations contain in synthesis all the facts to a first approxi- 
mation; and at this stage we must point out that in ordinary 
conditions of material motion (i.e. with velocities which are small 
Compared with that of light) the three gravitational equations 

<?« - I $9* = -« T* (i - 1, 2, 3), . (29) 

which are rigorously true in statical conditions, continue to hold 
to a first approximation if we suppose the quantities y t of order 
higher than the second (that of y and of the y^'s). In fact, the 
left-hand side of these equations, as we have already seen (cf. § 4), 
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becomes identically zero when we put g oi ~ 0: this means that 
if the three g {U 9 & are treated as quantities y i of a certain order 
of smallness, the left-hand side of (29) will be of at least the same 
order. 1 If therefore we suppose that the y/s are of order higher 
than the second, the left-hand side of (29) will also be of the 
same order, and therefore zero to a first approximation. As 
regards the right-hand side, we know (cf . Chapter XI, p. 356) that 
in a pseudo-Euclidean metric, and therefore (neglecting terms 
of higher order) also in the case we are considering, 

Tot = — € ft = — Taofii'y 

and hence, from the presence of the factor it follows that T oi 
is of higher order of smallness than T iH}9 and therefore that the 
right-hand side of (29) is of higher order than — k and is 
therefore zero to a first approximation. Hence, in these con- 
ditions, the equation (29) is satisfied. 

6. The Einsteinian ds 2 which corresponds to a first approxi- 
mation to an assigned Newtonian field. 

Suppose a Newtonian field and its potential U given. From 
the remark made in § 1, we can ignore the possibility (consequent 
on the motion of the material masses) that U may depend 
explicitly on the time, and treat U only as a function of the space 
co-ordinates, as if the masses were at rest in the positions they 
occupy at the instant considered. Consider a region not occupied 
by attracting masses, in which region A 2 V = 0. In order to 
characterize the corresponding Einsteinian ds 2 to a first approxi- 
mation we have to determine (cf. § 5) the functions y and y ik9 

where y is given by y — and is therefore harmonic (ie. a 

solution of (27')), and the y ik 's have to satisfy (26'), which can 
also be written in the simpler form 

a* = 3^- (26") 

1 The quickest way of showing this is to suppose that the g^a are of the form 
h 7*, where A is a numerical coefficient determining the order of magnitude, and 
the 7*'s are functions of position and of the time, to Lks treated as finite quantities 
together with their first derivatives, It is clear in this case that the left-hand 
side of (29) contains h as a factor. 
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since the covariant derivatives which would occur on the right 
would differ from the ordinary derivatives by terms of higher order, 
and can therefore be replaced by these ordinary derivatives. 

For the integration of these equations we note in the first 
place that we get a particular solution by taking 

Yik = 28fy (30) 

The proof follows immediately from the expression (25) for 
the a^'s, in which T w is of course put equal to zero. Substituting 
for in (26") the values 

is fi*v* + d2 Y'* _ l 2 y«L _ d ly f ±\ i . (29') 

: h [Sy h 2 Sy { dy k dy h dy k dy{dy h Y 

and remembering that y is harmonic, the required result follows. 

Since then the equations (26") constitute a linear non-homo- 
geneous system in the y lk \ the general integral is obtained by 
adding the solution (30) to the most general solution of the 
equations with the right-hand side zero, i.e. 

a* = 0. 

The general integral of this system could easily be constructed 
by using the result (cf. Chapter VII, p. 200) that for a three- 
dimensional manifold the vanishing of Ricci's symbols v a 4fr implies 
that all Biemann's symbols are likewise zero, or in other words 
that the quantities 

a* = &l + Y* 

are the coefficients of a Euclidean dl 2 (referred to any curvilinear 
co-ordinates whatever). But, as it happens, the addition to the 
particular solution (30) of the general integral of the homogeneous 
system has no interest, since, as we shall see shortly, this corre- 
sponds merely to a change of the co-ordinates of reference. 

In fact, the vanishing of the symbols a^, as we have just 
pointed out, expresses the necessary and sufficient condition that 
dP should be Euclidean, i.e. reducible, with a suitable choice of 

3 

parameters, to the form Z t - dyf. Hence, if a^, z 2 , x z denote the 

l 

co-ordinates of reference in their most general form, the most 
general method of defining a Euclidean dl 2 > with respect to these 
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co-ordinates x, will evidently be to introduce a transformation 
of any kind 

Vi = ft (*if *2> ^) <* = 1> 2, 3) 
between the y 9 & and the x\ and to take for the coefficients a ik 

3 

those which result from expressing S t dy* in terms of the differen- 
tials of the x'&. 1 

Assuming the functions y t (a^, x^ x 3 ) in the form 

as is always legitimate, and inserting the corresponding differen- 

tials in the trinomial S, rfjy, 2 , we get 
i 

s 

tfZ 2 ■= E lk a lk dx t dx k 
i 

' \dx k Ox J ! J dx t dx k 

In order to take account of the condition that the difference 
a tk — 8f — Yik is limited to the first order, together with the 
further condition that the difference between the Cartesian 
co-ordinate system of the t/'s and the curvilinear system of the 
x's is to be of the same older, 1 it is sufficient (and necessary) that 
wc should be able to treat the functions f and their derivatives 
as infinitesimals. Tt follows that 

< w > 

which constitutes the formal expression for the general integral 
of the homogeneous system a ik = 0 (the a^'s, in the form (29'), 
being linearly dependent on the y lA 's). 

1 Tf tins condition is not imposed, the only necessary condition is that the six 
numerical qxiantities 

ytk \d* k + ai J + V d*i dx' k 

should be infinitesimal, and this can be secured, a*» Prof. ALMAN8I has shown 
(cf . " JVordinana teoria dell' elasticita e la teoria delle deformazioni finite in 
Rend, ddla R. Acc, dn Limei, Vol. XXVI (second half year, 19X7), pp. 3-8), 
even when the quantities £ are not themselves infinitesimal, 
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But it is not this formal expression with which we are con- 
cerned, but rather the circumstance that the term (31), which is 
to be added to (30) in order to get the general integral of the 
system (26") with the right-hand side not zero, can always be 
made equal to zero by a suitable change of co-ordinates; this 
change being the substitution for the x 9 a of the combinations 

Vi = s*+ €i(*i* *2> *z)> • • - (32) 

the result of which is that the expression for dl 2 reduces, by 
a 

construction, to S £ dyf, all the differences — 8* vanishing. 
1 

When the y's are chosen as variables, the transformation (32) 
must naturally be applied also to the particular solution (30). 
But since the £'s are to be considered infinitesimal equally with 
y, (32) reduces, so far as (30) is concerned, to the mere substitu- 
tion of the y 9 s for the re's. The expression (30) for the particular 
solution, which alone is of any interest for our purpose, thus 
remains unaltered when the system of reference is changed to 
the y's. 

It is further to be noted that the elementary form L%y (the 
sum of the second derivatives) of the parameter also remains 
unaltered. 

From the foregoing arguments we see that in an empty field 
the statical potential U (Newtonian to a first approximation) is 
associated with a metric modification of ike associated three-dimen- 
sional s<pace. With a suitable choice of the co-ordinates of reference 
(the ^'s just defined) we have 

U 

with y harmonic (in the y's as well as in the #'s); the values of 
the coefficients a ih of the square of the line element are given 
(to the same degree of approximation) by the expression 
8j(l + 2y), so that 

dP = (1 + 2y) (dx x * + dx* + dxf). 

It will be seen that in general the space does not remain 
Euclidean even to a first approximation, but, to this degree of 
approximation, can only be conformally represented in a 
Euclidean space. 
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To gram up, remembering that g w — 1 — 2y and that g^ = 0, 
the ds 2 of the Einsteinian space-time belonging to an assigned 
Newtonian field of force with potential U = c*y is given by 

ds 2 = (1 — 2y) dx 2 — (1 + 2y) dl 0 2 , . . (33) 

where dl 0 is the line element of a Euclidean space. 

In the case of a single point-mass m 0 we need of course only 
take 

fm 0 I 
y ~ ~# r 9 

where r represents the distance between the mass and the point 
at which the attraction acts. 



7. Further approximation for the coefficient g m — V 2 in 
statical conditions. 

In the preceding chapter (p. 320) we saw that if the ds z of space- 
time is not far removed from being pseudo-Euclidean, then the 
motion of a material particle is affected only to a first approxi- 

2U 

mation by the second-order difference 2y = — between g w and 

c 2 

unity, so that the results are the same as for the Newtonian theory. 
If, however we wish to proceed to a further approximation, i.e. 
to calculate the principal part of the Einsteinian correction to be 
applied to the laws of the classical mechanics, we must not only 
find the second-order quantities y ik , which are the differences 
between the a, 7 /s and the Euclidean values (the y/s being of 
higher order), but we shall also need an evaluation of g w = V 2 
carried to the fourth order. 

This is easily found if we limit the investigation to the statical 
case and to a portion of empty space (with the energy tensor 
zero). The differential equation (21) of § 4 is then rigorously 
true, i.e. 

A 2 V = 0, (21") 

it being of course understood that A 2 refers to the spacelike cB 2 . 
To a first approximation, as has already been seen, we have 



V 2 = 1 — 2y, 
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where "y is proportional to the potential of the field, by (9). We 
shall accordingly have to put 

V = l — y—fa . . . . (34) 

where 0 is to be of higher order than the .second. The explicit 
expression of A 2 in generic co-ordinates (cf. Chapter VI, p. 154) 

gives in the first place, by (21 ") and (34), 

2* ^ Watf) = - k 1 (VaV). . (21-) 

From this we have to find ip to a fourth-order approxima- 
tion. As y is already of the second order, in calculating y l 

^from y t = = and \/ a we need only consider terms as far as 

the second order, i.e. we can use the form (cf. formula (33)) 

dl* = (i + 2y)dl 0 2 . 

This gives 

J a =- (l + 2y)>\ 
; _ y* 
7 1 + 2y 

whence, neglecting terms of higher order than the fourth, 

Jay 1 = 7l {\ + y) =. y,+ *(/),. 

-4 priori we do not yet know the order (by hypothesis cer- 
tainly higher than the second) of the additional term 0, which 
we have to calculate not only as far as its principal part of order 
v, but also so as to include additional terms, if any, up to the 
fourth order inclusive. For the moment we shall consider the 
part of order v. On the left-hand side of (21' ") we can substitute 
1 for %/ a, the difference between these two quantities being of 
the second order, which is equivalent to neglecting terms of order 
v + 2. As y is harmonic, it follows that 



A2<* f- iy 2 ) - 0 
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where A° as before represents Laplace's operator Ej s , whence 

1 ox? 

iff — — ly 2 + a harmonic function. 

With suitable hypotheses as to qualitative behaviour, it will 
be seen that the additional harmonic function must vanish, and 
there remains 

f ^ - Jy2 

as the principal term of the function t/s. As this is already of the 
fourth order, we can take — £y 2 as the expression for ijs correct 
to the fourth order inclusive. 

To the same order of approximation we get 

#oo -= V 2 = (l-y f £y 2 ) 2 

= 1 - 2y + 2/ (35) 

8. A theorem of mechanical equivalence. 1 

From the two preceding sections it follows that to a sufficient 
degree of approximation the Einsteinian ds* which corresponds 
to a statical Newtonian field of potential U, fixed in advance, 
is given by 

ds 2 = (1 - 20cfy o »- (I + 2y)rfV, . . (36) 

where y - ^ . (37) 

c 2 

^ = y-y 2 (37') 

(cf. formula (35) in the preceding section). 

In (36) we are satisfied with the first approximation for the 
coefficients of the spacelike dl 0 2 , while for V 2 the part which is 
of the fourth order is also given. This formula is a particular 
case of the ds 2 considered on p. 320 of Chapter XI (formula (27)). 
In order to define the motion of a material particle, i.e. the 
geodesies of a space-time of this kind, in accordance with the 
criteria of § 10, p. 320, we note first of all that (36) gives us 



l Cf. LRri-CtviTA, Bend. Acc. Lincei, Series VI t Vo). IV, 1926, pp. 3-5. 
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comparing this with the equations (28) on p. 321, and noting 
~^~\ is identical with jS 2 - ^ we see that it corresponds 

to the particular case in which the linear form T x vanishes and 
the quadratic form T 2 reduces to yjS 2 . This brings us back to 
the case considered in § 11, p. 323, the necessary values for 
the symbols then used being 

o 2U i * V* 



Equation (31) on p. 325 gives 

IP 



(38) 



which leads to the following theorem: The trajectories of the 
Einsteinian motion coincide to a second approximation with 
those of a Newtonian motion in ordinary Euclidean space for 
which the total energy is still E and the force is derived from the 
potential JJ V 

If ^ is the ordinary time in this auxiliary Newtonian problem, 
the corresponding integral of ms mva is 



This integral can be put in a more convenient form for the 
purpose we have in view. From equation (31) on p. 325, 
neglecting terms of higher order, V 1 + E, which we shall call 
Z7*, can be written in the form 

u* = (u+cy + E) (i + *v +x y 

whatever may be the values of tfs and x- I* 1 our case, since x = > 
V 2 

\ft = — ^ , we shall have 



Further, we saw in the section referred to that for the 
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Einsteinian motion, to the assigned degree of approximation, 
there exists the integral 

i^ 1 + ~% + x)-(U+ <ty) - E. 

If t is the variable which acts as the time in this problem, 
v 2 = (^jj^ • Substituting for x an d ^ their values, we can write 

From this and from (39) we get the differential relation 
dt = A, (l + 4 £) ; 

when the Newtonian problem is completely solved, this relation 
enables us to find also the law of the time in the Einsteinian motion. 

9. Motion of the planets according to Einstein, to a second 
approximation. Displacement of perihelion. 

The most striking application of the foregoing result is to the 
problem of the motion of the planets round the sun. If we treat 
the planets (as is in fact usually done as a first approximation) 
as material particles with mass so small compared with the sun 
that they do not perceptibly affect the field (or more generally 
the four-dimensional metric associated with the field), then dur 
problem is essentially that solved in the preceding section, for 
the particular case in which the function U is the potential of 
a single mass m 0 (the snn) which can be taken as the origin O 
of the co-ordinates. We have therefore, as in § 6, 

u = 

r 

where r is the distance between the sun and the planet, measured 
as if the space between the two were rigorously Euclidean. We 
know from the preceding section that as regards the trajectory 
everything happens as if the ordinary mechanics held and the 
planet were acted on by a unitary central force derived from 
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the potential (38). This consists of two terms, the first of which, 

^1 + corresponds to an attraction inversely proportional 

to r 2 , of radial component 

/ 4E\ dU k 
\ (?) At ' r*' 

where for brevity we have put 

k = i 1 + (40) 

and the second to a disturbing force, also central, but inversely- 
proportional to r 3 , of radial component 

3 aii* = _ k L 
c 2 dr r 3> 



where 



k t ^ 6 {^^°y (41) 



There are thus two modifications of the Newtonian law: 
(!) a change in the coefficient of proportionality, which becomes 

4rE\ . 

1 + ~-- \ instead of fm 0 ; (2) a disturbing force (of the second 

order relative to the Newtonian force) inversely proportional to 
the cube of the distance, and therefore of the type already con- 
sidered by Newton. Now it is known from the theory of central 
forces 1 that for motion in a plane under a force whose radial 
component is 

_ * _ h 

the equation of the orbit in polar co-ordinates r, 6 can be put in 
the form 

r = —-?. .... (42) 

1 + e cosat? 

1 See e.g. Lkvt-Civita and Amaliu : Lezioni di Meccanica Rationale, Vol. II, 
p. 200 (Bologna, Zanichelli, 1926); or Lamb: Dynamics, second edition, Chap. 
XI, § 91 (Cambridge University Press, 1923). 
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by a suitable choice of the direction of the polar axis, twhere, 
G being the area-constant, 



6 s ' p 

and e is a constant of integration, which can always be supposed 
positive, 6 being if necessary replaced by 9 + rr. 

All this holds generally. Now suppose in particular that 
e < 1, denoting elliptic motion as a first approximation (Le. for 
^ = 0, so that a reduces to unity). We can also suppose e > 0, 
which means that we exclude the case of the circular orbit. 
With this limitation, 8 in equation (42) can be made to vary 
without restriction, and the equation shows that when 0 increases 

by — <, r again takes the same value. This holds in particular for 
a 

the minimum value of r (i.e. perihelion); and therefore, for two 
successive passages through perihelion, the anomalies differ by 

— In the particular case a = 1 (elliptic orbit with fixed peri- 
a 

helion), the value of this difference is precisely 27r, so that the 
difference ^ n \ 

a = — 2rr — 2tt I - — 1 ) 
a \a / 

represents, in magnitude and sign, the angular displacement of 
perihelion in one revolution. With the value of a given above, 
taking into account the smallness of we have 

h 

Since for a, which is already a correction, we need only a first 
approximation, we can take for G 2 its Newtonian value 1 
6?a = fm 0 p = f?n 0 a(l — e% 

where a and e denote respectively the semi-major axis and the 
eccentricity of the orbit. Using the value (41) of A^, we get for 
the displacement of perihelion the expression 

—as- ■■•_•«"» 

which was first calculated by Einstein. 

1 Cf. Lbvi-Civita and Amaldi: op. eit., p. 212. 
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In order tb adapt the formula to numerical calculation for 
any planet, we introduce the mean radius a 0 of the earth's orbit, 
and write (43) in the form 

1 — e 2 a 0 c? a * 

The eccentricity of any planetary orbit being small, we can 
at once put e 2 = 0. The radius of the orbit being a, we know 
that the velocity v in the orbit is given by 

v* _ fm 0 
a a* ' 

which expresses the equality of the attraction and the centripetal 
acceleration. For the earth we have in particular 

v o — — > 
a 0 

and accordingly (43) becomes 

"= fe 3f < 4 »'» 

The velocity v 0 of the earth in its orbit being practically 
30 km. per second, and c being 300,000 km. per second, we have 

approximately -° — 10 ' 4 , and therefore 

c 

a = 6tt.10- 8 % 
a 

For Mercury, the planet nearest the sun, and therefore 
evidently showing the most perceptible effect, — = 0-39, which 

gives for a a little more than one-tenth of a second. Since Mercury 
completes about 420 revolutions in a century, we thus find for 
the perihelion of its orbit the centennial displacement of 42", 
which corresponds exactly to the difference between the total 
observed displacement and the amount predicted by ordinary 
celestial mechanics from the Newtonian theory of the per- 
turbations due to the other planets. It was precisely this 
residual shift of about 42" per century which before the 
birth of the relativity theory could only be explained by 
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introducing hypothetical disturbing forces with constants deter- 
mined ad hoc. 

For the other planets, the corresponding calculation naturally 
gives a much smaller centennial shift, hardly 8*6" for Venus, 
3-8" for the earth, 1-35" tor Mars, and still less for the others, 
and the results of observation which are at present available are 
not accurate enough to provide any basis of comparison with 
these figures, 

10* Displacement of the spectral lines. Deflection of light. 

In this section we propose to examine the effect of a field 
of foBee on the frequency and the path of light rays. We suppose, 
as in the preceding section, that the field is statical, with a New- 
tonian potential Z7, and we consider regions of the field external 
to the attracting masses. The effect to a first approximation will 
be sufficient for our purpose, and we can consequently assume 
that the expression (33) of § 6 

ds* =r- (1 - 2y)dx 0 2 - (1 + 2y)dZ 0 2 , . . (33) 

where y stands for ~ , holds for the four-dimensional ds 2 . 

Now suppose that a phenomenon which is predominantly 
timelike (e.g. the vibration of an atom) takes place at a specified 
point T. If dt is an elementary interval of time in which this 
phenomenon is considered, and if within this interval the varia- 
tions dy t of the space co-ordinates are assumed to be negligible, 
we shall have from (33) (since cr 0 — ct) 

ds% =- (1- 2y 7 .)c 2 <fc 2 r , 

where the suffix T denotes that the values in question are 
those belonging to the phenomenon at the point T. If the 
phenomenon takes place instead at another point S we have 
analogously 

dsl = (l-2y s )<*dt% 

Now suppose that we have two identically similar phenomena 
at different points, e.g. the emission of light from two atoms 
chemically alike and in identical physical conditions. If we 
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admit that in such a case the space-time interval da? will be the 
same for both, the foregoing formulae will give 

This differential relation between corresponding times of the 

two phenomena under discussion, expressing the constancy of 

dt ^ 
the ratio naturally implies that the same ratio exists between 
dt T 

any finite pair whatever of corresponding intervals, A^, and At T ; 
in particular, if the phenomenon considered is periodic, between 
the respective periods or between the reciprocals of the fre- 
quencies v s and v r We thus have, neglecting terms of the second 
order, 

v ^zjy = y T - Ys = l <u T - u s ), 

which shows that in a gravitational field the variation of the 
frequency is of sign opposite to that of the potential; hence, in 
particular, there will be a reduction of the frequency for a given 
spectral line (and therefore a shift of the line towards the red end 
of the spectrum) on passing to a region of higher potential. 

By way of example, let us compare two monochromatic light 
rays emitted in the same conditions on the earth T and on the 
sun S. We can neglect U r in comparison with U s and take for 
U s (cf . the preceding section) the value 

77 _ fm Q fm 0 a 0 

where r 0 denotes the sun's radius. As we saw in the preceding 
section, we now have 



a, 



0 y 



0 



the (relative) variation of the frequency, if Av — v T — v s , is 
therefore given by 

U s = A* = V «o (44) 

c 2 v c 2 r 0 

CD 655) 
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and since in round numbers 



V ° = 10 - 4 , °° = 200, 
c r 0 

we & *1 = 2 X 10-- 

It was uncertain for some years whether there did in fact 
exist a shift of this kind towards the red for the solar rays, as 
compared with corresponding rays emitted from a source on the 
earth. The most recent measurements by Perot, Fabry, and St. 
John tend to confirm its existence. 

A more remarkable verification has recently been provided 
by St. John, who, following up a suggestion of Eddington's, has 
observed analogous displacements in the spectrum of the Com- 
panion of Sirius. 

We now pass to the consideration of the path of a light ray 
in a field of force. Along any ray we shall have in the first place 
(cf. Chapter XI, p. 336) ds 2 = 0, and further, the field being 
statical (Chapter XI, p. 340), Format's principle 

Sjdx 0 — 0 

will also hold. 

Since ds 2 = 0, the expression (33) for ds 2 gives 

dx 2 — 1 + 2y dl 2 
uj}q — ^ ai 0 , 

and therefore, neglecting squares of y 9 

dx 0 --= (1 + 2y) dl 0 . 
The rays are therefore defined by the variational equation 
Sj(l + 2y)dl 0 = 0 (45) 

At this point we note that in an ordinary Euclidean medium, 
isotropic but not homogeneous, of refractive index n(y x > y 2 , 
the geometric path of a ray, by Fermat's principle, is charac- 
terized by the variational formula 

Sfadl 0 = 0; 
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comparing this with (45) we see that in our field of force, with its 
given by (33), light is propagated as if the space were Euclidean 
and filled with a medium of refractive index 

fi = 1 + 2y. 

This remark becomes even more expressive if we refer once 
more to the trajectories of a dynamical problem. In fact, as 
we have already had occasion to show in § 1J, pp. 323-325, 
the principle of least action leads to the result that the curves 
(45), or, what comes to the same thing (multiplying by c 2 and 
remembering the meaning of y), those for which 

8/c 2 (l + 2y)dl 0 = 8f((?+2U)dl 0 = 0, . (45') 

can be considered as the trajectories of a material particle in 

c 2 

ordinary space in a field of potential £c 2 (l + 4y) = — f- 2U 

2 

and with total energy zero, or, if we prefer, in a field of potential 

c 2 

2U and with total energy . 

Z 

It is interesting to observe that even in the classical mechanics 
the mere hypothesis of the materialization of energy leads us to 
predict a curved path for rays in a gravitational field. If in fact 
we admit that light rays ? regarded as lines of flux of energy, are 
effectively trajectories of material particles, then each of these 
rays— their mutual reactions being supposed negligible— ought 
to behave like a free material particle moving under the action 
of the force in the field (of potential V) with a velocity which 
tends to c at an infinite distance from the attracting masses 
(i.e. for U = 0), or, which comes to the same thing, with total 
energy \c 2 per unit mass. It will be seen that general relativity 
implies, to a first approximation, solely the substitution of 2U 
for U. Now apply these considerations to the path of the rays 
in the sun's gravitational field. In accordance with the above 
remarks, these rays are to be considered the trajectories in the 
problem of the motion of a point attracted by a fixed centre of 
force, the potential, with the same notation as before, being 

2U = 2/m 0 
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and the total energy 

These trajectories are obviously conies with a focus at the 
centre of force. The species will depend on the sign of the constant 
E\ in our case E > 0, so that the curves are hyperbolas. Since 
the divergence from a rectilinear path must be very small, it is 
self-evident that these hyperbolas will be only very slightly 
curved; this can also be proved analytically from the differential 

equations. To show this, let n and ~ denote the direction of the 

P 

principal normal and the curvature at any point of a ray. 
Equating the centripetal acceleration to the centripetal force 
per unit mass, we have 

p dn 

The derivative represents the force in the field in the 
an 

direction n, and cannot therefore be greater than the intensity 

= U i of this force. Further, the integral of vis viva 
r 2 r 

shows that if we neglect terms of the second order, v may be 
taken as equal to c. Consequently we have 

1 < 1 2/m 0 
p c 2 r 2 

If r 0 is the sun's radius, the maximum possible value for the 
force in the space traversed by the light rays is evidently 

that given by r = r 0 ; the above inequality can therefore be 
written 

1 < 2/m 0 
P ^o 2 r/ 

As *^ m ° is the value Z7 & . of the potential at the surface of the 
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sun, and the value of the ratio ~J is 2 X 10" 6 (cf. formula (44)), 
we get finally 6 

- <4 X 10-*I. 
P *o 

In other words, if the radius of curva- 
ture p is not infinite as for a straight 
line, it is at any rate of the order of a 
million times the sun's radius. 

It is therefore perfectly legitimate to 
assume that the rays are in any case 
only very slightly bent, even if they 
pass very close to the sun; in every case, 
therefore, the hyperbola in question will 
have its asymptotes OA\ OT (cf. fig. 4) 
almost in one straight line. 

* Consider in further detail the hyper- 
bolic ray which grazes the solar sphere 
at V. Let O be the centre of the hyper- 
bola, S the centre of the sun and there- 
fore the focus of the given branch of the 
hyperbola. V will be its vertex, and, fir. 4 

if a denotes the transverse semi-axis 
and e the eccentricity, we shall have by definition 

OV = a, OS = ae, SV = r 0 = a(e — 1). 

We know also from analytical geometry that if 8 represents 
the exterior angle between the two asymptotes 

8 1 

sin = . 
2 e 

In the case we are considering, 8 must be very small; hence, 
from this formula, e is very large. Our results will be quite suffi- 
ciently accurate if we take the sine of the angle 8 as equal to the 

arc, and consider - negligible in comparison with unity. Thus 
e 

we can write 001 o 

S = ? = 2 1 1 




e 
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Using the relation r 0 = a(e — 1), we get finally as the 
measure of 8 in terms of the two lengths r 0 and a 

S=f- (46) 

In the classical theory, the transverse semi-axis a in the 
hyperbolic motion due to the Newtonian attraction of a mass 
M is connected with the constant E of the vis viva by the relation 

E =m. 

2a 

Putting for E its value |c 2 , and noting that in our case 
M = 2m 0 , this gives a, and (46) becomes 

•-^ '«-) 

and therefore, using the numerical value already found for this 
expression, 

8 = 8 X 10-°. 

The right-hand side is a pure number, which gives the angle 
8 in radians. In seconds 

8 = 1-7" (47) 

It will at once be seen that this angle S gives the measure of 
the deflection, i.e. the maximum angular deviation to which a 
stellar ray can be subjected by the sun's gravitational action. 
Suppose in fact that we are considering a ray of light which starts 
from a star A and arrives at a terrestrial observer after describing 
an arc of a hyperbola which grazes the solar sphere at V, as in 
fig. 4. The direction of the hyperbola at T, along which the 
observer receives the light ray, is indistinguishable from that of 
the asymptote OT'; the direction in which the light left the star 
is that of the tangent at A, which in its turn is indistinguishable 
from the other asymptote A'0> so that the deflection is the 
exterior angle between A' O and OT\ i.e. 8. 

The direction A'Q will naturally be identified with the 
direction in which T sees the star in normal conditions, i.e. when 
the sun leaves the earth-star direction and the corresponding 
gravitational perturbation becomes imperceptible, so that the 
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visual ray again becomes rectilinear (or so nearly rectilinear that 
the difference is absolutely imperceptible). 

It may be well to point out that if the visual ray from a star 
does not graze the solar sphere but passes at a distance r > r 0 
from the centre of the sun, the deflection diminishes, being in 
inverse ratio to the perihelion distance r. This can be seen as 
follows. The expression (46) for 8 naturally holds for any star 
whatever which is visible from the earth, provided r 0 is replaced 
by the perihelion distance r. We shall thus have 

g 2a 2a r 0 

r r 0 V 

2a 

The factor — has been calculated above, so that we have 
finally r o 

8 = 1-7" X r °. 

r 

Since r 0 corresponds to an angle of 16', it will be obvious that 
if the angular distance from the centre of the sun is even a few 
degrees 8 will not be more than some hundredths of a second, 
and will therefore be totally imperceptible, just as if the ray 
were rigorously rectilinear. 

The angular displacements, if any, due to the sun become 
capable of observation during a total eclipse. A first attempt in 
this direction was made by the Lick Observatory in 1918, but 
the precision of the observations was insufficient for the purpose. 

For the total eclipse of 29th May, 1919, two simultaneous 
expeditions were organized by the Royal Society of London: 
one for Sobral in the north of Brazil, the other for the island of 
Principe in the Gulf of Guinea, both localities being within the 
zone of totality of the eclipse. The results of the observations 
made by these two expeditions can be summarized as follows. 
For the deflection of light the mean value of the displacements 
observed at Sobral gave 1-98", with probable error + 0-12"; 
at Principe the mean value was 1-61", with probable error 0*30". 
The deflection 1*76" predicted by Einstein's general relativity 
lies between these two. This provided a new and striking con- 
firmation of Einstein's theory, as the observed results were 
definitely incompatible both with the zero deviation of geome- 
trical optics, and with the deviation of half this value (0-88") 
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which would be given by the ordinary theory combined with the 
simple postulate that mass and energy are proportional. 

On the occasion of the next total eclipse (21st September, 
1922), visible in Western Australia, three further expeditions 
started for the zone of totality; the American one, organized by 
the Lick Observatory and conducted by Campbell, was the only 
one to secure any useful observations. But the available stars 
were rather far from the limb of the sun, and the deflection was 
therefore small; the results 1 show a wide dispersion, so that 
many astronomers do not regard their mean value as a further 
confirmation of the theory, although it is in almost perfect 
agreement with the Einsteinian prediction. 

11. Three-dimensional metrics with spherical symmetry. 

We shall begin by defining what is meant by saying that a 
metric manifold V 3 has spherical symmetry round one of its 
points 0. We shall follow the geometrical method suggested by 
Palatini 2 , considering along with the F 3 an ordinary Euclidean 
space V'z in one-to-one correspondence with it. This corre- 
spondence being established, any point-transformation (T) of 
F3 into itself (in particular, a rigid motion of V' A ) gives rise 
to an analogous point-transformation of F 3 into itself. There is, 
however, no a priori reason that a rigid motion of F 3 should 
correspond to a rigid motion of F 3 , a rigid motion of a manifold 
being taken to mean any transformation which Jeaves dl 2 un- 
changed, and therefore, in particular, changes geodesies into 
geodesies. 

We shall now say that a metric manifold F 3 has spherical 
symmetry round one .of its points O when each of the oo 3 rigid 
rotations of V' s round the corresponding point 0' determines a 
rigid motion in F 3 . 

Some important properties of the metric of a F 3 with this 
property follow easily from the definition, subject naturally to 
the obvious condition that the metric (i.e. the coefficients of dl 2 ) 
is regular in the region round every point, except possibly the 
point O. It can at once be shown that to any ray f drawn from 

1 Published in the Lick Observatory Jiuttetln, No. 840, 1923. 

2 Of. " Lo spostamento del perielio di Mercuric, ecc," in Nuovo CimerUo K XIV 
(1917), pp. 12-54. 
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<y there corresponds in V z a geodesic j drawn from O. Thus, let 
P' be any point on f which is not O', P the corresponding point 
(which is therefore not O) in F 3 . Let g be the geodesic in V 3 
which is tangential to j at P; from the qualitative hypotheses of 
the case it follows that g exists and is unique. We have to show 
that g coincides with j m 

Consider in the 00 1 rotations which have f for axis: 
these correspond to 00 1 rigid motions in the space V 3 which 
leave fixed all the points of j, and only these. If we suppose that 
g is distinct from j 9 the effect of the 00 1 rotations round f would 
be that g would occupy a simple infinity of positions, retaining 
in each the properties of being geodesic and tangential to j at 
P; we should therefore have an infinite number of geodesies 
drawn through P in the same direction, which is impossible; 
hence g must coincide with j. 

An obvious deduction is that to any spherical surface 2' 
with centre O' there corresponds in V 3 a geodesic sphere 2 with 
centre O. 

Now consider any pair 2, 2' of these surfaces, and the corre- 
spondence between the points Q of one and Q' of the other 
determined by the correspondence between the two spaces. We 
wish to show that the correspondence between Q and is 
conformal. 

Let da' be a generic line element in 2' drawn from Q' y da the 
homologous element drawn from Q. If we suppose the Euclidean 
space referred to polar co-ordinates r, 0, <f> 9 we shall have 

da'* = r 2 (d0 2 + sin 2 0 d<f>*) 

where r = 0'Q\ Further, when r„ 0, <f> are known, they deter- 
mine Q\ and therefore also Q, from the one-to-one correspondence; 
8 and <f> can therefore also be regarded as curvilinear co-ordinates 
of Q on S, and the line element da, corresponding to da' (i.e. to 
arbitrary differentials dd and d<f>), will in every case be repre- 
sented by a quadratic form which we propose to find. 

Consider two elementary arcs da' of equal length, drawn 
from Q' in two different directions. The two homologous arcs 
da will also be equal. For the two arcs da', being equal in length, 
can be obtained from one another, by a rotation round 0' Q'; 
hence we infer that the two arcs da can also be obtained from one 
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another by a rigid motion in F 3 , and are therefore of equal length 
with respect to the metric of F 3 . 

It follows that the ratio ^ is the same for the two directions 

do' 

considered, or, in other terms, that this ratio is the same whatever 
the differentials dd and d<j> may be. It is therefore a function 
H of position alone, i.e. (a priori) of r, 6, <f>; but it will at once be 
seen that this function must be the same whatever may be the 
point Q' of 2' considered, since we can always pass from one 
Q 9 to another by a rotation. We can therefore put 

do* - H 2 do' 2 , 

where H denotes a function of r only. 

For what follows it is perhaps advantageous to replace the 
co-ordinate r (the radius vector in F a ) by a function R(r) defined 
by the equation 

R(r) = H(r)r (48) 

The square of the line element of the geodesic sphere 2 thus 
takes the form 

da 2 = R 2 (d6 2 + &m 2 0d<f> 2 ); . . . (49) 

this gives us the geometrical significance of R, no longer in the 
auxiliary Euclidean metric, but directly in F 3 . In fact, the 
expression (49) for da 2 is that for a sphere of radius R in ordinary 
space, and as such (cf. § 7, p. 240) has Gaussian curvature 
1 

K — — ; this curvature, from its intrinsic nature, belongs to 
R 2 

any surface whose line element is given by (49), and therefore, in 
particular, to our surface 

We can therefore attach the following significance to the 

co-ordinate R: ^ represents at any point the Gaussian curvature 
R 2 

of the geodesic sphere with its centre at the centre of symmetry 
0 and passing through the point. From the property of symmetry 
it follows at once that all the geodesies drawn from 0 cut the 
sphere 2 orthogonally; hence if we denote by dg the elementary 
arc of one of these geodesies, the dl 2 of F 3 can be represented 
in the form 

dl 2 = dg^+do 2 ; 
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and since dg depends solely on R (also from symmetry) we can 
put 

dg = A(R)dR 9 

where A is a function of R, a priori undetermined, so that we 
get in consequence, with the help of (49), 

dl 2 = A 2 dR 2 -J- R*(dd 2 + sin 2 0 d<f> 2 ). . . (49') 

This is the most general expression for the dl 2 of a F 3 which is 
symmetrical round a point. 1 

It is not without interest to show that every V 3 of this kind 
can be conformally represented in Euclidean space. It will be 
sufficient to show that we can determine two functions H{r) 
and r(R) such that we have identically 

A*dFP + R 2 {d6 2 + sin 2 d <fy 2 ) ---- H 2 {dr 2 + r 2 (<Z0 2 + sin 2 0rfc£ 2 )}; 

the necessary and sufficient conditions for this are 

Hr = R, Hdr = AdR 9 
and therefore, eliminating H, 

— == A - - (50) 

r R 

When A is a known function of R, this determines r, except 
for a constant multiplier, which from the strictly geometrical 
point of view remains arbitrary. The modulus // of the conformal 
transformation is then defined by 

H = - (51) 

T 

We shall now calculate Ricci's symbols a Uc (Chapter VII, 
p. 199) relative to a metric of this kind. We again make use oi 
the property of symmetry, noting that an obvious consequence 
of the considerations set out in § 12, pp. 201-208 is that if the 
quadric which determines the local distribution of curvature has 
an axis of symmetry, this axis gives one of the three principal 
directions, while the other two are indeterminate (i.e. may be 

1 This formula had been given as early as 1896. from analytical conside rations 
based on the theory of groups*. Of. Atti delta M. Acc. dei Lmcei, Vol. V (second 
half-year, 1896), pp. 164-171. 
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any pair of directions orthogonal to each other and to the axis 
of symmetry). In our case, a point P distinct from 0 having been 
fixed arbitrarily, and the behaviour of every metric property 
being symmetrical round the geodesic g which joins 0 and P, 
it follows that the quadric of curvatures at P is necessarily 
symmetrical round the direction of g. Hence at every point 
our co-ordinates r, 6, <f> give principal directions of curvature, 
from which it follows at once that in the quadric of curvatures, 
and therefore in the tensor a ik , the product terms are missing, i.e. 

aa — 0 for i =f= k. 

In addition, if a} x is the principal curvature corresponding 
to g, the other two curvatures o> 2 , o> 3 are equal to one another; 
we shall denote their common value by cd. 

We may now recall formula (47) on p. 207, viz. 

'A 

which gives explicitly all the a's as functions of the curvatures 
and of the moments of the principal lines. Since these coincide 
with the co-ordinate lines, along which vary R alone, 6 alone, 
and <f> alone, respectively, they will have for parameters 

*!-^f-y *? = °> «-<>; 

X\ ~~ 0, A a = 0, A? = --^ = — . — - i 
3 1 dl R aind 

and therefore the moments will be 





A, 


^1|2 


= o, 




0; 


Ajjj! = ~ 


0, 


Ao|2 




A 2[3 --- 


0; 


Agj! = 


0, 


^3|2 


= o, 


1 3 r " 


R sin0. 



Substituting in the formula quoted above, and putting 
o> 2 — <*>3 — °>9 we get 

a ii — A^co^ a 22 = R 2 a>, a 83 = a 22 sin 2 0, (52) 
auc ^ 0 (i 4= *) (53) 
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The equations (53) have already been obtained from the 
consideration that in our case the principal lines of curvature 
coincide with the co-ordinate lines, 

We shall now calculate explicitly the value of o> at a generic 
point P from its definition as the Riemannian curvature. From 
symmetry, it can be considered as belonging to any geodesic 
surface whatever with pole P and containing the direction R. 
We shall show that the surface <f> — constant is a particular case 
of such a surface. Take the differential equations of the geodesies 
in our V 3 (of line element dl), not, however, in the form given 
in (47) on p. 134, where they are solved for the variables, which 
would require the calculation of Christoffers symbols, but 
in Lagrange's parametric form, starting from the Lagrangian 
function (the vis viva) 

In the case we are considering 

T — 1{A*&+ ^(0* + sin 8 6 ft) }, 

(where a dot over a letter denotes differentiation with respect to 
the parameter t), and therefore 

d<f> 

% - »• 

From Lagrange's equation for the juigle <f>, viz. 

d dT _dT _ 
dt 9^ d<f> 

it follows on integrating that one of the equations of the geodesies 
has the form 

J2 2 sin 2 0 $ = constant. 



From this it follows that if a geodesic issuing from P touches 
initially the surface <f> -~ constant (so that <f> = 0 at P), ^ 
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vanishes along the whole geodesic, which therefore belongs to 
the surface <j> — constant passing through P, as we wished to 
prove. To find o>, we have therefore to find the curvature of the 
binary differential form 

A*dR* + R*d0 2 (54) 

which expresses the square of the line element of the surface 
<f> — constant. 

The general expression for this curvature is 

a 

(formula (28), p. 194); as our a is A 2 R 2 , it only remains to cal- 
culate Biemann's symbol of the first kind, (12, 12) by means 
of formulae (3) and (5) of Chapter VII. The explicit expression 
for this wae formed by Gauss, and is given in all treatises on 
the subject. We thus get 

= 1 rf /1\ 1 d /1\ 2 

w Aii dii \a) 2R dR \a) ' 

For the curvature co x we find 

1 R 2 { A 2 S 

An independent calculation of these expressions is given in 
the following section. 

12. Digression on the calculation of curvatures. 

While our specific object is the calculation of a> and o> 1? we 
may here, for the convenience of the reader, show how the 
explicit expression for the curvature of a binary form as a function 
of its coefficients can be obtained without calculating ChristoffePs 
symbols. 1 We shall start from the geometrical property of the 
curvature expressed by formula (29') on p. 195, viz. 




1 Cf. F. Sbrana: Rend, della R. Acc. dei Lhvcei, Vol. XXXIII (second half- 
year, 1924), pp. 236-238. 
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where DT denotes the area of an infinitesimal circuit T con- 
taining P, and € represents the angle of parallelism. In order to 
reduce the calculation to a minimum, we shall calculate e with 
reference to a dl 2 of orthogonal form, of the type 

Edx 2 + Gdx 2 (55) 



If on leaving P the direction X which is being displaced makes 
an angle a with the co-ordinate line x lf its parameters A 1 , A 2 
are plainly given by 

A* ~ co * a , A» = ™ ... (56) 



(cf. §§ 4 and 7, pp. 92, 98). Now consider an infinitesimal dis- 
placement 8P, of contravariant components 8x lf 8x 2 ; we know 
that when X is given a parallel displacement along 8P the incre- 
ments 8A 1 ' of its parameters are given by 

SA' - - i jt ( jl, i) \>8x t (i - 1, 2) (57) 

(formula (23), p. 110). In order to avoid the necessity of calcu- 
lating the coefficients on the right-hand side (Christoffel's symbols), 
we note that the equations 

2 

&i= — X H [jl, i) x f x L 
i 

of the geodesies have on the right-hand side quadratic forms 
whose coefficients are precisely the symbols we need. Further, 
the first of the Lagrangian equations of the geodesies corre- 
sponding to the form (57) (the equation relative to xj is 

d dT _ dT = 
dt dxy dx 1 

where T — \(Exf + Gx 2 ) y 

or, performing the differentiations and solving for x l9 
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where JE V E 2 , G x represent derivatives of E and G with respect 
to x± and x 2 . Comparing this with the first of the equations 
(57), we see that the latter can be written 

But from (56) we get 

SA 1 = - y]9 A 2 8a — A 1 8 log +JE, 

and substituting from this in the preceding equation there 
results 

8a = -], (£ 2 Sa~- G l 8x 2 ). 

The angle of parallelism is obtained by integrating 8a round 
the circuit T. Replacing the line integral by a surface integral 
in the usual way (for the signs, cf. footnote, p. 190, Chapter VII) 
we get 

)v[dx x \2JeGI dx 2 \2VWJ 1 2 

Noting that the field of integration reduces to the infinitesimal 
element 

Dr = >/EGdx 1 dx 2 , 
we can write (neglecting infinitesimals of higher order) 

This gives the required expression for the curvature, viz. 

k = - 1 r~ i 7 — 1 W — ^ -mi 

2 J EG LS^ Vi<?/ WEg/ 1 
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For E — 1 (for which the lines x x are geodesies) we get in 
particular the formula 

# = - -L d *^® 
s/G dx 2 

which is frequently used in the theory of surfaces. 

For the line element given by (54), putting x 1 = R, sr* = 0 9 
so that E = A 2 (R) 9 G = R 2 , the curvature K becomes 

w ~ ~ in k ■ • • (58) 

as stated in the preceding section. 

We now come to the calculation of co l9 the curvature corre- 
sponding to the section normal to the lines R. It is to be noted 
that the spheres R — constant, unlike the surfaces tj> = con- 
stant, are not geodesic surfaces, so that does not coincide 

with the Gaussian curvature § 11) of these spheres. To 

R 2 

calculate it, instead of using the direct definition it will be more 
convenient to use the property that dl 2 can be conformally repre- 
sented in a Euclidean space, with 

dl 2 = H 2 dl 0 2 , 

as we have already seen. 

In § 4, p. 228, we found the explicit form of the relations 
between homologous Riemann's symbols for two line elements 
ds and ds' for which 

ds' 2 = e 2T ds 2 . 

We shall identify ds with our dl, and ds' with the Euclidean dl 0 ; 
we can then apply formulae (18) of p. 231 by making the symbols 
marked with a dash vanish (since they refer to the Euclidean 
dl 0 2 ) and putting 

r = — logff (59) 

The formulre then become 

(ij, hk) = a £h (r Jk — TjT k ) — a ik (r Jh — t,t a ) — a Jh (r i/c — r, r fi ) 

+ a Mc( T th — T 'i T h) + <*jk — «t* a jh) At, 
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where the coefficients, the covariant derivatives, and the para- 
meter A are all taken to refer to 

dl 2 = A 2 dR* + B*(d6* + sin 2 6 d<f> 2 ). 

Multiplying these formulae by a jh a ik and summing with 
respect to the four indices, the left-hand side, by formulae (1) 
and (2) of § 2, gives the linear invariant G relative to our F 3 , 
which after some obvious reductions is thus expressed by 

G = — 4A 2 t — 2Ar, 

3 3 

where At = E^t' = 1 & ac a Vc T i T k (Chapter VIII, p. 231); and 
A 2 t « i ile a ik r ik (Chapter VI, p. 154). 

1 

But for a F 3 the linear invariant G is equal to — 2<%f (cf. 
§ 4), so that the mean curvature £M is in our case given by 

Jil — 2A 2 r + At (00) 

As (the sum of the three curvatures) = a> 3 + 2a>, and oj 
has already been calculated, this formula will give the required 
expression for oj^, it remains to find the values of the quantities 
A 2 t and At on the right, using for this purpose the formulae (50), 
(51), and (59). 

From the general expression 

3 

At £ t -*a/*T t .T*, 
i 

we have for our rfi 2 , and for a function t which depends only on J2, 

A- = 

the dash here also denoting the derivative with respect to R. 
Further, from the general expression (18) on p. 154 we have 

and since now %/a = AR 2 sin#, it follows that 

1 d 



AzT = Air- dR 
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In our case, from (59) and (51), 



t = — logff = log-, 



and by (50) _|logr = < 



hence 
It follows that 
At = 



A — l 



A 2 T r.-: 



U-i) 2 
1 d 

AW- dR 



AR 2 \ A) AR dR \a)' 



using the expression (58') for to, this can also be written m the 
form . . 

Substituting in (60) the expressions just found for At and 
A 2 t and for Jil its value co x + 2co, 2o> cancels out on both sides, 
and we get forojj the value stated at the end of § 11, viz. 



~ i ( ] - i)- 



(61) 



13. The gravitational equations in the case of spherical sym- 
metry. Schwarzschild's rigorous solution. 

We shall now apply the equations of the Einsteinian statics 
to the particular case of a single attracting mass, or more generally 
of a distribution of masses having spherical symmetry round a 
point O. Using the terminology of § 11 we shall deal with matter 
distributed in accordance with any law dependent only on R in 
layers bounded by geodesic spheres of centre O. The Einsteinian 
da 2 will have the statical form 

ds* = V*dx*- dl*, 



where dl 2 will necessarily be of the type (49'), and F, from 
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symmetry, will also depend only on R. We shall agree to consider 
only regions outside the field occupied by the attracting masses. 
In these regions the statical equations (21'), (22) of § 4 for empty 
space will hold, i.e. 

<M = 0, (21') 

«*+ y - 0 (22) 

Since denotes the mean curvature of the symmetrical V 3f 
i.e. tho sum u> x + 2o> of the three principal curvatures, (21'), 
together with (58) and (61) of the preceding section, gives 

1 r R*\ A 2 / R dR \A/ 

whence on separating variables and integrating 

A 2 = — I — , (62) 

1- » 
R 

where a denotes a constant of integration. 

It is to be noted that whatever the constant a may be the 
expression found satisfies the physically necessary condition that 
at an infinite distance from the attracting masses the metric 
tends towards the Euclidean form. In fact, if R — - co 9 A — * 1 , 
so that the dl 2 (49') becomes the ordinary Euclidean expression 
in polar co-ordinates. 

The symbols a lk are then completely defined by the formulas 
(52) and (53), where o> and a> x have the values (58) and (61). 

In order to put the gravitational equations (22) in an explicit 
form, we must again replace the covariant derivatives V \ k by 
the ordinary derivatives. This can also be done without any pre- 
liminary calculations, as follows. Let the sc-'s denote generic co- 
ordinates in a space with a generic metric. Take a function V 
of the x's, and consider its variation along a geodesic line along 
which the x's are considered as functions of a parameter t. We 
shall have in the first place 

_ _ 2,,. V t x. 

Differentiating again, and substituting for the x/s their values 
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as given by the equations of the geodesies, we get, as a particular 
case of the notion of covariant derivatives (Chapter VI, §§ 1 
and 2, p. H4), rf2y 3 

d p — ^ikVik^^k (63) 

Further, assuming in particular x r = 22, x t = 0, x s — ^, 
and remembering that our V is a function of R only, we have 

d Y. = V'R 
dt 

(dashes denoting derivatives with respect to R), and 
d 2 V 

a *= F'72 2 -f V'R. 
dt 2 

But for our metric, i.e. for 

T - ±\A 2 R* + R*(d*+ sin 2 0^ 2 )], 

the equation of the geodesies for the co-ordinate R gives 

d dT _3T ^ 
dt dR dR 

or A 2 R + d( fP- W - I d{ f*} R 2 - R{6 2 + sin* 6 <j> 2 ) = 0, 
dR dR 

whence we get 

R = - A A &+ | 2 (0i + sin* 0 <£ 2 ). 
Using this result the foregoing expression for - becomes 



= ( F " - ~a') m + % (&i + sin2 5 



This expression, like (63), must hold along a generic geodesic, 
i.e. for arbitrary values of the quantities x± = R, x 2 = 0, 
2% = Comparing them we get 



V A' ~RV 
= 0 (i 4- k). 



(64) 
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Substituting in the gravitational equations (22) these values 
for the Ftf/s and the values (52) and (53) for the a^'s, we see 
at once that the equations with two distinct indices reduce to 
identities, those for the pairs of indices 11 and 22 take the form 

V" V A r 
+ v ~ VA = °' * • • • (65) 

R V 

*■« +" VA, = °. 

and the remaining equation is the same as this last one. Sub- 

A' 

stituting in this equation for cd its value (58), i.e. - — , it becomes 

RA Z 

A' V 

%+ y v~° 9 (66) 

or AY — constant. 

At an infinite distance from the attracting masses the 
Einsteinian ds 2 must reduce to the pseudo-Euclidean form, and 
therefore the coefficient V (the Romerian velocity of light) must 
tend to 1 like A; hence the constant must have the value 1, 
and we have 

AY = 1 (06') 

This equation and (62) give A and V in finite terms, so that 
the required ds 2 is now completely determined. The equation 
(65) remains to be considered, but it will at once be seen that 
with the values (62) and (66') it reduces to an identity. In fact, 

substituting for w 1 its value ( 1 — ' \ ' (1 — F 2 ), and 

it \ JL / It 

A' V' 
for by (66), the equivalent — - , multiplying by V 2 , and 
Jl V 

remembering once more that AY = 1, (65) becomes 
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On substituting for V 2 the value — = 1 — a , ' we find 

that this equation is satisfied identically, which proves the 
required result. 

The rigorous form of the Einsteinian ds 2 with spherical sym- 
metry is therefore 



ds 2 = (l - ^ dx 2 -dl 2 . . . . (67) 



with dl 2 = J!fL_ + R 2 (d0 2 + sin 2 0 d<f> 2 ). 

This expression for ds 2 was first given by Schwarzschild. 1 
The metric contains a constant a which is a priori arbitrary; its 
value can be deduced from a consideration of the intensity of the 
field of force at great distances from the attracting masses. In 
these regions the spacelikc dl? tends, as we know, to become 
Euclidean, Ti becoming identical with the length of the 
radius vector drawn from the centre of symmetry; further, the 
expression 

- &v*= 

represents the potential of the field (cf. Chapter XI, p. 328). Com- 

fM 

paring this with the classical Newtonian expression J for the 

potential due to a mass M concentrated at the origin (or sym- 
metrically distributed round it in any way), we see that we must 
put 

« (68> 



where M is the sum of the attracting masses. 

It follows from § 1 1 that every dl 2 with spherical symmetry, 
and therefore in particular the Einsteinian dl 2 (67), can be con- 
formally represented in a Euclidean space, the modulus of the 



1 Sitzungsbcrichte der Prcuss. Akad. der Wins., 1910, pp. 189 196. 
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R 

conformal representation being — , where r is defined by (50), 
i.e. by r 

dr _ AdR 

T~ ~r - 

Using the relations 

*-T '-til 

we have to express the right-hand side of this equation in terms 
of V, which gives 

dr = 2dV 
r 1-72' 

whence on integrating 

1 + V 

r==r °T=V 

where r 0 denotes a constant. 

If we wish to impose the natural condition that r ? like R, 
shall tend to become identical with the ordinary radius vector 
at an indefinitely great distance from the attracting masses, 
we shall have to determine r 0 in such a way that 

lim - = 1. 
r — -00 r 

Since when 22 — - 00, V — ~ 1, this gives 

Consequently H = ~ = j^y^ 

and therefore = HUl<* *= --2L 

(1 + V) 4 

As an instructive example, we shall apply these rigoroua 
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formulae to calculate over again, for a symmetrical field, the 
expression (33) of § 6, viz. 

ds 2 = (1 - ty)dx 2 - {1 + 2y)dl 0 2 

^where y stands for — ^, which to a first approximation gives the 

Einsteinian ds 2 corresponding to an assigned Newtonian field. 
In our case comparison of the coefficients V 2 and 1 — 2y of dx Q 2 
gives rigorously 

so that, from the value (68) of a, U is precisely the expression for 
the Newtonian potential of a mass M symmetrically distributed 
round the centre. Comparison of the coefficients of dl 0 2 imposes 
the condition (at least to a first approximation) 

Prom the expression 1 — 2y for V 2 we have to a first approxi- 
mation V — 1 — y, and therefore 

(1 + V)~* = (2 - y)"« = I (1 + 2y), 

which ensures that the above condition is effectively satisfied so 
long as we neglect terms of higher order. 



14, Spatially uniform metrics; their cosmological interest. 

We shall now examine whether there exist solutions of the 
gravitational equations in statical conditions, and on the hypo- 
thesis that the spacelike dl 2 has a constant curvature K and that 
the energy tensor is also uniform, meaning by this that it is 
of the type (66) of p. 358 (applied to the statical case). This 
is equivalent to assuming for the T&b the expressions 

^00 - V 2 (e-p) = V% . , . . (70) 
T ik = pa tJt (i,k = 1,2,3), . . (71) 
where the a^'s obviously denote the coefficients of dl 2 . The two 
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quantities r\ ( > 0) and p represent respectively (cf . Chapter XI, 
p. 358) the energy density and the pressure (or pull, if p < 0) in 
the medium. 

We next take into account the geometrical hypothesis that 
the spacelike manifold has constant curvature K. When the 
three principal curvatures o> 2 , o> 3 all reduce to K, the canonical 
expressions for Ricci's symbols (formula (47) on p. 207), 
together with (46) on p. 206, give immediately 

a ik = Ka £/C9 (72) 

while by the definition of the mean curvature we have 

<M = 3K (72') 

Using these results, the first of the gravitational equations, 
(18) of § 4, becomes 

3K — k V (73) 

We deduce from this that K > 0, which comes within the 
general observation of § 4 that in statical conditions the mean 
curvatxire is always either positive or zero. The equation (73) 
then shows that 17 is necessarily constant when K is, or in other 
words that the medium must have a uniform distribution of 
energy, or, what is the same thing, of matter. 

On account of this circumstance, this type of solution has a 
particular cosmological interest. Tt is true that the celestial 
bodies are separated by distances which are large compajred with 
their dimensions, and therefore the distribution of matter in 
space is essentially discontinuous; but from a statistical point 
of view it is natural to ask what are, so to speak, the mean 
mechanical conditions of the universe; i.e. what would be the 
nature of the space-time metric on the hypothesis that the whole 
of the cosmic matter, instead of being concentrated in discrete 
masses, is uniformly distributed throughout all space, with the 

mean density \ of the actual distribution. 

c 2 

It is important to note that, as we are dealing with a space 
of constant positive curvature, its extension S (in the sense of 
Chapter VI, p. 160) is,finite, as we shall show in a moment. As- 
sociating it in the meanwhile with the foregoing cosmological 
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consideration, we reach the conclusion that in this type of solution 
the total quantity M of matter is finite, and is given by 

M = l S (74) 

In order to find the extension 5, we take dl 2 in the canonical 
form (31) on p. 240, viz. 



dl 2 ^ 1 
u 2 u 2 



~ ° - Ad yi 2 +dy^+dy 2 ) y . (75) 



K 3 
where u == 1 + r 2 and r 2 = S^ 2 . . . (76) 

4 1 

We have in the first place, for the element of volume corre- 
sponding to the Euclidean dl 0 2 referred to polar co-ordinates 

dS {) = r 2 drmi0d6d<f> 9 
and therefore, for the corresponding element of physical space, 

as - dS ° 

The total volume is in consequence given by 

the integral being extended to the whole of space. The integration 
with respect to 6 and <f> gives in, so that we can write 

Jo ^ 3 

Here we can introduce the radius a of the sphere of Gaussian 

1 r 
curvature K, putting K ~ --, and substitute x — for r 

a 2 la 

as the variable of integration. This gives 

S = 327m 3 /*' X —%~ =Wa\ 
J 0 (1 + x 2 ) 8 
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and therefore, from (74), 

M = 2n*a a1 t. 

Cr 

In the given conditions, physical space has thus the volume 
2n 2 a 3 9 and is therefore finite, though at the same time unlimited. 
This latter property holds, as for ordinary two-dimensional 
spherical surfaces, for any manifold of constant positive curvature 
in any number of dimensions. 

Another general property which calls for mention is that in 
a variety of the kind specified the geodesies are all closed lines, 
of length 2ira. Consider specifically the case of three dimensions 
which corresponds to the physical space of the problem under 
discussion. It will be seen immediately that without loss of 

dl 2 

generality we can always refer dl 2 — |- to polar co-ordinates 

r, 0, j> in such a way that for a geodesic assigned in any manner 
tf> = 0 at one point of it; from the Lagrangian equation relative 
to the parameter <f> it then follows, as in § 1J, that <f> = 0 all 
along the curve, which is therefore a geodesic of one of the sur- 
faces <f> — constant, or in particular, by suitable choice of the 
<f> - axis, of <f> = 0. In view of the transformation formulae 
between Cartesian and polar co-ordinates, 



this is equivalent to saying that any geodesic can always be 
considered as belonging to the co-ordinate plane y 2 — 0; but, 
for y 2 = 0, dl 2 assumes the canonical form of a two-dimensional 
manifold of constant curvature K, i.e. of the ordinary sphere of 
radius a. The geodesic therefore coincides with a great circle 
on this sphere, and is therefore a closed curve of length 2rra. 

We now pass on to the other six gravitational equations. 

Taking account of (71) and (72), the equations (17) become 



which can be satisfied in two different ways, according as we 



y x — r Bind cos<£, 

y 2 rrr. T SU10 Sm<£, 

y s - - r cos0, 
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suppose V constant (Einstein's cylindrical space-time) or V a 
function of position (De Sitter's hyperspherical space-time), 1 

15. Einstein's solution. 

First, suppose V constant. In this case it is necessary and 
sufficient to add to (73) the condition 

K + k P ^ 0 (78) 

From this it follows first of all that the normal stress p is 

necessarily the same at every point, and on comparison with (73) 

thero follows . 

P = - H (73') 

whence we get the following result: 

In a homogeneous medium subjected to a uniform pull of 
\rj, 7] being the energy density, the space assumes the constant 

positive curvature K — - 77, the velocity V of light remaining 

3 

constant (and being naturally supposed not zero). 

Remembering that in statical conditions the potential of the 
force in the field is — (Chapter XI, p. 328), we see at once 
that in the present case the force is zero. 

1G. De Sitter's solution. 

Now suppose that V is a function of position. Multiplying 
(77) by a ik and summing with respect to i and k we get in the 
first place 

^ + -0 

or ^- = UK+Kp). 



The equations (77) are therefore equivalent to 

" + K* a a = 0 . . . . (77') 



Vile 



V 

where for brevity we have put 

K* = K— i(3K + k P ). . . . (79) 

1 Of. T- Lbvi-Civita : "Realta fisica di alcimi spazi normali del Bianchi," in 
Rend, delta It. Ace. dei Lincei t Vol. XXVI (first half-year, 1917), pf>. 519-581. 
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It is easy to see that the equations (77') are mutually con- 
sistent for V not constant (in fact, they constitute a complete 
system with respect to V considered as the unknown function) 
if, and only if, K* = K. To prove this, take the commutation 
formula (20) on p. 186 for the second covariant derivatives of 
a simple system V h which gives 

F„-F,«« hk}V r . 
i 

Substituting for Riemann's symbols of the second kind the 
expressions for a manifold of constant curvature K (formula 
(19') on p. 234), viz. 

we get V lhk — V Lkh — K (a a . V h — a ih F, f ). 

Further, multiplying (77') by V and taking the covariant 
derivative, we get 

V m = -#»a*y A ; .... (77") 

substituting in the preceding equation, we get the conditions of 
integrability 

(K* - ~ K) (a, k V h — a ih V k ) - • 0 

for every set of values of the three indices i, h, k. Since by hypo- 
thesis V is an effective function, one at least of its derivatives 
(say V k ) will not be zero. Jn the above equations take this value 
of k and a value of h different from h\ multiply by a lh and sum 
with respect to i. This gives 

(K*-K)V k - 0 

whence K * — K = 0, 

Q. E. D. 

Using this result, we get from (79) 

3A r + Kp = 0, 

which leads to the same qualitative statements with regard 
to the stresses as those made above for the cylindrical space- 
time. 
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For the integration of the equations (77'), in which from now 
onwards we put if* — JSC, we must again take dl 2 in the canonical 
form (75). 

The covariant derivatives V ik of V with respect to our dl 2 
can be found explicitly as functions of the ordinary derivatives, 
without direct calculation, from the considerations in Chapter 
VIII, pp. 222-232. In fact, considering our dl 2 and the corre- 
sponding Euclidean dl v 2 referred to the same co-ordinates, we 
have, from formula (9) on p. 224, 

v, k -v? k = -hplv» 

1 

where, by (16) on p. 230, 

pl = K^")r Birg— a hk r\ 

with in our case e~ ' lr ------ u 2 9 u having the value given in (7(5). 

Noting that for dl Q 2 referred to Cartesian co-ordinates the 
derivatives V% are identical with the ordinary second derivatives, 
and that r l = r h a ik - Sj", we have the required expressions 
in the form 

V lh - ■ \- 1 (u t V t + u k V k ) - 5 k u t V t . 

dy t dy k u 11 1 

Substitute these expressions in the equations (77'), which on 
multiplying by uV take the form 

u 

for K* = K and cu = -'. Putting for brevity 

u 

W ---= uV (80) 

and using the expression (76) for «, we get 

« a~~- + «* V > + u : V k 57" — o 8 ' ; 



whence it follows that 



S* I - — — + S, «| K, 1. 

\2 W M 1 / 
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But by (80) 

Vi = — u t ; 

u u 

substituting, and taking into account the definition (76) of u and 
the consequent identity 

K+Xtu? = Ku, 
i 

the foregoing equations become 



From this it follows at once that for i 4= k the second deri- 
vatives of W vanish, so that W must be a function with the 
variables separated (the sum of three functions, one of y 1 alone, 
one of y 2 alone, and one of y 3 alone). Further, for i — the 
equations above show that as the terms on the right are the same 
for all three cases, we must have also 

d*W d*W d 2 W 

all equal; their common value must therefore be a constant, 

K 

which we can denote by 6 0 c . Hence the most general expression 
for W is of the type 

W - b ° K f* + w+ C, 
4 

where w is a linear homogeneous function, a priori undetermined, 
and C is a constant. The coefficients of this expression are to 
be so determined that 



i.e. that i t y t W t — W = 6 0 u. 

1 



By Euler's theorem on homogeneous functions the linear 
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term w contributes nothing to the left-hand side, so that its three 
coefficients are arbitrary; there thus remains 

6 « A ' r 2__ 0 ---= b 0 u, 
4 0 

which by (76) reduces to 

c - - v 

Hence the final expression for W can be written in the 
form 

W - b 0 (u -~ 2) + w, .... (81) 

the constant b 0 and the coefficients of w being still completely 
arbitrary. This number of constants could of course be predicted 
from the fact that the system (77') is completely integrable; as 
all the second derivatives of the function V are defined by it, 
it is obviously equivalent (cf. Chapter II, p. 13) to a total differen- 
tial system in four unknown functions, viz. V itself and its 
three first derivatives. 

It is also to be noted that the three constants of integration 
which appear in the linear expression 

w = 2 (b x y 1 + b 2 y 2 + & 3 y 3 ) 

can obviously be reduced to one, since by a suitable orthogonal 
transformation applied to the jy'a (for which r 2 , u, and <2Z 0 2 are all 
invariant), we can always reduce the trinomial to the form 2by^ 
with b = +Jb 2 rV+~6 3 2 . 

But we may also suppose b — 0; this can be formally proved 
(though in a less elementary way) by taking account of the 
homogeneity of a space with constant curvature, which enables 
us to take a point fixed in advance as the point y t — 0, while 

dl 2 

retaining the canonical form 0 for dl 2 . 1 

u 2 

1 This becomes intuitive for the case of two diriien.si.oii8, in which a manifold 
of constant positive curvature is an ordinary sphere, and the canonical expression 
for dl 2 is obtained by stereographic projection of the sphere on a diametral plane 
(cf. Chapter VIII, p. 241). The assertion in the text reduces in this case to the 
obvious geometrical fact that any point whatever of the sphere may be chosen as* 
the centre of projection. 

(D665) IS 
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Using these results, it follows from (80) that the expression 
for the spatially uniform ds 2 , on the hypothesis of V variable, is 

where W — b 0 (u — 2), \ 

1 ,K ' (82) 

u — 1 + r 2 

4 l 

It is assumed that the constant b 0 is not zero, as otherwise 
we should have identically V ~ 0, which is not permissible, 
since we are considering the case of V variable. 

In view of the physical significance of F, those points, if any, 
at which V — 0 obviously denote singularities in the field; they 
remain, so to speak, optically isolated, in a sense which will be 
explained further on. On the other hand, as r, and therefore u, 
increases indefinitely, V tends to 6 0 ; further, for finite values of r, 
u remains essentially finite* and > 1, so that the singular points 
are determined by the equation W — 0. This equation, com- 
bined with (82) and the relation K — \, becomes 

a 2 

r 2a. 

which in the representative Euclidean space defines a sphere 
D 0 . The surface 1) which corresponds to it in the physical space, 
and which, by § 11, its also a (geodesic) sphere, is called the horizon, 
because it constitutes in a certain sense the limit of the perceptible 
universe. This follows from the fact that light, and a fortiori 
a material particle, would take an infinite time to reach it. To 
prove this, let A and B be two generic points; then by the 
definition of V the time taken by light to pass from A to B is 



J V J uV I 



W 



where the integral is taken along the ray joining^ to B. When 
B tends to the horizon the integrand tends to an infinity of the 
first order at B> and therefore the integral cannot remain finite. 

As we have already several times recalled (in particular in the 
preceding section), the force in the field is the gradient of — %V 2 ; 
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in consequence it tends to displace the material masses towards 
the regions of minimum V 2 , i.e. towards the horizon. This 
circumstance was regarded as an incongruence of De Sitter's 
space-time; but it is to be observed that it must be taken to refer 
solely to accidental masses (sufficiently small not to modify the 
field perceptibly), and not to those uniformly diffused masses 
which constitute it, the equilibrium of which is automatically 
assured by the gravitational equations. 

It is interesting to remark that the problem of spatially 
uniform metrics (§14) admits of a solution which includes both 
Einstein's and De Sitter's solutions as particular cases. 1 

In fact, in the argument beginning at equation (77 '), it was 
tacitly assumed, at (77"), that K* is constant. If we drop 
this supposition we find 

V illk - V m = K* (a lk V h - a iu V k ) + V ( aik K* h - a lh I? k ), 

which, on ( combination as before with 

Vihh — V lk/l = V h — a ih V k )> 

gives 

(K - K*) («,, V„ - a ih V k ) - V{a lk Kl - a lh K' k ). 

If we put E for A' — /i*, this becomes 

E{a lk V h - a ih V k ) + V(a tk E k - a ih E k ) - 0, 

leading, by the same treatment as in the former case, to 

EV h + V E h - 0, 
or EV = constant 

= A, say. 

Equation (77') may now be written 

( r -ff)„ + M r -*)- 0 - 

Thus, if instead of (80) we write 



1 This extension of the analysis was suggested to me by Dr. John Dougall. 
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the investigation proceeds exactly as before, and leads to the 
same value of W, viz. that given in equation (81). The new 
value of V is therefore 

v = A + y 

K u 

= A . b 0 (u - 2) 
u 

If A = 0 we have De Sitter's solution; if b 0 — 0 we have 
Einstein's. If both A and b 0 are different from zero, the 
curvature K is still constant, but the (normal) stress p is 
variable, being given by 

K — K* E 
A 

- y 

or l(3K + K P ) .- 

We shall conclude this section by showing that lie Sitter's 
space-time not only, like Einstein's, implies that physical space 
(i.e. any manifold x 0 — constant) has constant positive curvature 
K, but has itself, as a four -dimensional manifold, constant 
negative curvature. 

To prove this, we start from a known property of every space- 
like dl 2 which has constant curvature A", namely (Chapter VIII, 
p. 234), that Riemann's symbols for dl 2 have the form (19') of 
p. 234, or 

{ir, hk) - • K(a ih S;; - a it 8 r h ) (t, r, h, h = 1, 2 ; 3). 

By (11) and (13) of § 4, these relations can be written in the 
form 

{ if, hk j ' ■ - - K(g, k 8; - g, k h' h ), . . (83) 

still for the same values 1, 2, 3 of the indices. Now it is easy to 
see that these last formulae, in virtue of the expressions (14') 
for Riemann's symbols for our ds 2 and of the equations 

Y* =_- - Ka ik - K 9tk (i, k = 1, 2, 3), (77"') 
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will still hold when 0 is included among the values to be assigned 
to the indices. This is obvious when one, three, or four indices 
are equal to 0, since then (§ 4) both sides of the equation vanish. 
In the case of two indices zero we have, as in § 4, to examine the 

two types {Or, OA}', [i0, Ok}'. The corresponding values of the 

s y 
left-hand side are respectively V(V r ) k — V S £ a ri and 
ie. in view of (77 "'), 1 V 

- K V* E, a" a ik — K V*8 r k — — - Kg* 8J, Kg ik . 
i 

The values of the expression on tlie right are clearly the same. 
Thus the equations (83) hold for all values of the indices from 
0 to 3, which is precisely equivalent (still by formula (19') of 
p. 234) to saying that the ds 2 of space-time has constant negative 
curvature — K. 

It may be well to observe that while the notion of a manifold 
of constant curvature and the measure K of this curvature are 
by their nature invariant, i.e. independent of the choice of the 
co-ordinates of reference, this invariance does not persist for 
multiplication of ds 2 by a constant factor m. In fact, when all 
the coefficients g tfr are multiplied by m, Riemann's symbols of 
the second kind are unchanged, so that, again by formula (19') 
of p. 234, the curvature K is divided by m. In particular, for 
m — — 1, it changes sign. This explains the apparent contra- 
diction between our enunciation and that of some writers who 
take — ds 2 as the fundamental form and assign constant positive 
curvature to Dc Sitter's space-time. 

17. Einstein's additional term. Indication of other rigorous 
solutions. 

For Einstein's solution we found in §§ 14 and 15 (formulae 
(73) and (78)) 

3K = k V , K + Kp = 0. 

We cannot therefore suppose the matter devoid of stresses 
(p — 0) without concluding that -q — 0, which brings us back 
to the uninteresting case of a totally empty space. Now if 
we take the cosmologico-statistical point of view (in the sense 
indicated in § 14), it seems reasonable to suppose that there 
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must be a solution of the gravitational equations corresponding 
to the hypothesis of a uniform distribution of matter which shall 
be so tenuous that the molecular actions between contiguous 
particles, and therefore the stresses, are imperceptible; such, 
that is, that p — 0, while 77 is a constant other than zero. 

Since the gravitational equations in the original form (8), viz. 

\Og ik - -kT i1c , 

have no solution of this type, Einstein was led to modify them 
(very slightly) by adding a term which maintains the tensorial 
character of the equations (8), and which in ordinary cases is 
completely imperceptible while serving to render possible a solu- 
tion of the type indicated. This term was assumed by Einstein 
in the particularly simple form Xg lk , A denoting a constant which 
in most cases is negligible compared with G. The gravitational 
equations so modified are 

(i.fc - 0,1,2,3). J " " ( } 

The statical equations accordingly become 

M A - K7), 

** + ^ - + *) a, : - kT & (i, k - 1,2, 3). 

Proceeding as in §§ 14, 15, on the hypothesis that the space- 
like dp has constant curvature, that the density is constant, and 
that the stresses are isotropic (i.e. are given by (70) and (71)), 
we ultimately reach the two equations 

3/f — ky]-{- A, 
K + Kp 1=- A, 

between K, 77, p, and A, which take the place of (73) and (78). 

Here it plainly becomes possible to put p = 0 without 77 
necessarily having to vanish at the same time; we need only 
take 

2K 

K 
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To get an idea of the order of smallness of the constant A, 

we may note that the mean cosmic density ~ of matter can 

c 2 

certainly be regarded as considerably less than that of the nebuleo. 
which is of the order of 10" 17 gm./cm. 8 . It is therefore legiti- 
mate to assume that in any case 

A - K<: kc2 10- 17 . 

2 

From the numerical values (in C.G.S. units) k ~ 2.X 10" 48 , 
c = 3 X 10™ we have 

A £<9x 10' 4f> . 



For the radius a of the universe 




we thus get a lower 



limit given by in22 

* a > 10 22 cm. 

This radius is therefore, certainly considerably greater than 
JO 17 km. or 10,000 light-years. 

We shall conclude with some bibliographical references con- 
cerning the rigorous solutions of the gravitational equations (with 
or without the cosmological term) in some special cases. 

Schwarzschild's solution is supplemented or generalized in 
various important respects by the original contributions of 
Birkhoff, De Donder, Eddington, v. Laue, 1 and Weyl, which 
are given in their respective treatises, and of Signorina Longo 2 , 
Trefftz, 3 Nuyens, 4 and Vanderlinden. 5 

A different type of solution is considered in the researches of 
Weyl, 6 Levi-Civita, 7 Bach, 8 Chazy, 9 Palatini, 10 and Kasner. 11 

1 Of. also Sitzungabcrichte dcr Preux&. AH: der Wins., 19213, pp. 27-31. 

2 Nuovo Cimento, Vol. XV, 1918, pp. 191-211. 
8 Math. Annalcn, Vol. 86, 1922, pp. 31 7-326. 

4 Comptes Jlrndus, Vol. 176, 1928, pp. 1376-1379. 
fi Bull, de VAc. mi/ale de Belr/inuc, 1921, pp. 260-276. 

6 Anna/en drr Phytdk, 54 (1918), pp. 117 145; 59 (1919), pp. 185-1S8. 

7 4 * ds 3 einaUiiniani in campi newtoniani Noten I -IX, in Itrnd. dilla /?. Arc 
dei Lined, Vols. XXVI, XXVII, XXVIII, 1917-1919. 

8 Mathrmatinche Zeitschrift, Vol. 13, 1922, pp. 134-145. 

9 Bulletin de la SocMe Math, de France, Vol. LII, 1924, pp. 17 37. 

10 Nuovo Cimento, Vol. XXVI, 1923, pp. 5-24- 

11 Trans, of the American Math. Society, Vol. 27, 1925, pp. 101-105, 155-162. 
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P. 122, line 9 from foot. See a short but substantial article by 
E. Cartan, who discusses the question exhaustively from the 
geometrical point of view: Annates de la Societe Polonaise de Mather 
matujues, Vol. VI (1927), pp. 1-7. An earlier paper by M. Janet, 
ibidem, Vol. V (1926), pp. 38-73, may also be consulted. 

P. 168, at the end. A luminous demonstration of M. Fermi's 
theorem, as simple as it is intimately related to fundamental prin- 
ciples, has been given recently by Mile. P. Nalli: Rend. Acc. Lincei, 
Vol. VH (1928), pp. 195-198. 

P. 171, at the end. The use of local] v geodesic co-ordinates enables 
us to recognize at once an important property of the e-systems, which 
they possess in common with the fundamental tensors a,^ a i!r 
— their covariant derivative vanishes identically. For each element of 

an e-system is either zero or of the form *Ja, i —r . The deri- 
ve 

vatives of the « t/ /s being zero (in geodesic co-ordinates, for the 
point considered), the same is true for every element of an e-tensor. 
It follows (p. 71, final paragraph) that the covariant derivative 
vanishes in any system of co-ordinates whatever. 

P. 188, line 3. The general case in which the cycle T and conse- 
quently the area V are not restricted to be infinitely small, can 
also be treated without great difficulty, as has been shown very 
ingeniously by J. M. McConnkll, Rend. Ace. Lincei, Vol. VII (1928), 
pp. 208 213, 306-309. 

P. 209, end of footnote. See also J. L. Synoe, On the Geometry of 
Dynamics, Phil, Trans. Roy. Soc, A, 226 (1926). pp. 31-106; and 
various notes by MM. Berwald, Boggio, Cartan, Crfdjsli, 
De Mira Fernanpes, Onickscu, Vrak(*eaki t , Rend. Arc. Lincei, 
Vols. V, VI and VII (1927, 1928). 
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ADDITIONAL NOTES 



P. 228, after formula (14). Formula* (13) and (14) can be proved 
more readily, without any formal development, by means of geo- 
desic co-ordinates, as has been remarked by Mile. Nalli. See her 
note Due dimostrazioni nel calcoh assoluto, Boll dell' Unione Mat. 
Italiana, Vol. VII (1928), pp. 124, 127. 

P. 234, line 10 from foot. A simpler proof, due to Mile. Nalli, 
is given in the paper cited in the note to p. 228. 

P. 439, at end of references. On all these questions, Darmois, 
Les equations de h gravitation einsteinimne, Fasc. XX, Memorial 
des Sciences MatWmatiques (Paris, Gauthier-Villars, 1927) may 
also be consulted. 
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Action, in mechanics, 331. 
Action, stationary, 324, 331. 
Addition of tensors, 75. 
Angle between two directions, 92. 

in V m 123-126. 

Angular metric, 121. 
Antisymmetncal systems, 66. 
Area, clement of, 99. 
Associated tensors, 95. 
with respect to quadric, 96. 

— vectors, 140. 
Aton\ vibration of, 400. 
Attracting mass, spherical, 419-423. 
Attraction on planet, 369. 
Autoparallelism of geodesu, 104, 140. 

Bending of light lays, 402, 403-408. 
Bianchi's derived vectors, 137-140. 
Bibliography. See Preface. 

— of solutions of Schwarzschild type, 
439- 

Bilinear covariant, 18, 20, 21. 

— form, 66, 70. 

Binary form, curvature of, 414-419. 

Canonical form of quadratic, 206, 281. 

— system as to given congruence, 278. 
Cartesian co-ordinates, 61. 

— co-ordinates locally, 164, 167, 171, 
202. 

Central forces, 397. 
Change of co-ordinates, 61. 

— variables, 2, 61. 

general, 80. 

Characteristic of envelope, 101. 

matrix, 9, 39, 87, 250. 

surfaces, 47. 

ChristofTel's symbols, derivatives of 
art in terms of, in. 

determinant a in terms of, 112. 

of first kind, 109, in. 

of second kind, 110, in. 

vanishing of, 121. 

Classical mechanics, correction to, 392. 



( t meaning of constant, 335. 
Coefficient #, )0 , second approximation to, 
392-394- 

Coefficients of ds 2 9 by experiment, 363- 
368. 

- qualitative properties of, 369. 

quadratic form, covariance of, 73. 

Commutation of second derivatives, 

273. 
— rule, 184. 

Complete system of partial differential 

equations, 52, 53. 
— total differential equations, 1 5- 

18. 

Composition of tensors, 79. 

velocities, in relativity, 317. 

Compounded tensor, 79. 
Conformal representation, 229, 246. 

— — of Einsteinian in Euclidean space, 
4^3- 

of K a , 411. 

Congruence, canonical sxstem as to, 278. 

— geodesic, 262, 274. 
— curvature of, 275. 
normal, 263, 275, 285. 
of curves, 46, 47. 
solenoidal geodesic, 363. 

Congruences in Euclidean space, 282. 

— set of normal, 277. 

— - sets of, 265. 
Constant, gravitational, 386. 

— h\ in Einstein's equations, 387. 
Continuity, equation of, 347, 349-35 1 ! 

3fa>, 363. 

molecular, 361. 

Continuous system, mechanics of, 347, 

352- 

— , — in covariant equations, 348, 349. 

— — in terms of encrgv tensor, 351. 

— — relativistic equations for, 359. 
Contraction of bodies in motion, 313. 

— of tensors, 77-79. 
Contravariance of the a tk \ 92. 
Contravariance, m-fold system, 69-71. 
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Contravariancc, simple system, 67, 81. 

— transformation by, 67, 69—71. 
Contravariant differentiation, 149. 
Co-ordinate hypers urf aces, angle be- 
tween, 228. 

— lines, angle between, 128. 

parameters of, 98. 

moments of, 98. 

Co-ordinates, Cartesian, 61. 

— curvilinear, 61, 87. 

— in space-time, 290. 

— locally Cartesian, 164-171, 202. 
geodesic, 164, 167, 171. 

— Flicker's, 68. 

Cosmological interest, solutions of, 426. 

— term, Einstein's, 438, 439. 
Covariance, m-fold system, 69—71. 

— simple system, 67, 68, 82. 

— transformation by, 64, 67-71. 
Covariant, bilinear, 18, 20, 21. 

— derivative, 146. 
second, 184. 

— differentiation, 144, 149. 

of invariant, 147. 

of vector, 147. 

— simple system, typical, 82. 

systems, sets of, 74. 

Cramer's rule, 54, 55. 

Curl of vector, 161. 
Curvature, calculated, 413. 

— constant, canonical forms for mani- 
folds of, 238, 240. 

manifolds of, 236, 246. 

— Gaussian, 172. 

- — — of ordinary surface, 193. 

— geodesic, IS.S-^- 
of congruence, 275. 

— hyperspherical, 258. 
formula for, 260. 

— lines of, 286. 

— locally constant, 235. 

- — mean, of physical space, 382. 

of V nt 258. 

of K a> 372. 

— of a K a , 201. 

calculated, 413. 

formula for, 203. 

— of space, in Einstein's space-time, 
429, 438. 

— principal, in K 3 , 204. 

- — Riemannian, of V n , 195. 

- space of constant, 425-439. 
Curvatures, calculation of, 414—419. 
Curvilinear co-ordinates, 61, 87. 
Cylindrical space-time, Einstein's, 429. 

Density, mean cosmic, of matter, 426, 
439- 

— of energy, 350, 351. 
momentum, 350. 



Density of nebulas, 439. 
Derivative, covariant, 146. 

— of vector, 139, 140. 
Derivatives, transformation of, 85. 

— of determinant a, 112. 

De Sitter's space-time, 429-435. 

— - constant negative curvature of, 

436. 

Determinant, functional, 2, 4-12. 

— <2, derivatives of, in terms of Chris- 
tofters symbols, 112. 

— reciprocal elements in, 54, 55, 80, 81. 
Developable, circumscribed, 10 1. 

— surfaces, 100, 101. 

Differential equations, linear partial, 
33-6i. 

normal form of, 36. 

— total, 13-33. 

— parameter, first, 231. 

— parameter, second, 154, 393. 

— parameters , in special case, 418. 
-- total, 13, 64. 

Differentials, linear transformation of, 
80. 

Differentiation, covariant, 144, 149. 
Direction. See Versor. 

— spacehke, 356, 357- 

— timelike, 356, 357. 

Directions of co-ordinate lines, in V, /t 

surfaces, in V nt 127. 

— specification of, 90. 

Discrete system, motion of, 360-363. 

incompressible, 361. 

Discriminant ofay's, 157. 
Displacement, cyclic, 173, 186. 

— infinitesimal, 104. 

— parallel, 103. See Parallel displace- 
ment. 

— spectral, 400. 

</.T a and gravitation, 375. 

— - approximately pseudo - Euclidean, 

— coefficients of, and gravitational 
experiments, 367. 

— by experiment, 363-368. 

— covariance of coefficients of, 90. 

— discriminant of coefficients of, 90. 

— expression for, 88. 

for an Einsteinian space-time, 392. 
■ - — assigned Newtonian field, 388- 
392. 

single point mass, 419—423. 

— generalization of, 320. 

— indefinite, 141. 

— in mechanics, 293. 

in variance of, 294. 

•- invariance of, 308, 311. 

— qualitative properties of coefficients 
of, 369. 
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ds 2 same for dynamics and light, 336, 
337. 

— statical, 326, 327, 371, 377, 378,392. 

Einstein tensor in, 380, 381. 

Riemann's symbols in, 379. 

— ten functions involved in, 322. 

— vanishing, for light, 332, 338. 

variational principle for, 340. 

Divergence of double tensor, 154. 

gradient of vector, 154. 

stress tensor, 344. 

vector, 153. 

Dual variables, 68, Hi. 

Eclipse observations, 407, 408. 
Einstein's cylindrical space-time, 429. 

— form of Hamilton's principle, 291. 

— gravitational equations, 376. 

— tensor, 200, 371. 

divergence of, 371. 

for a V 2t 371. 

— in statical ds 2 , 380, 381. 

— — linear invariant of, 371. 
Einstemian, and Newtonian, trajec- 
tories, 395. 

— motion, law of time in, 396. 

— space-time, ds 2 for an, 392. 
Electrodynamics of bodies in motion, 

3". 312. 
Element of area, 99. 
Elongation, relative, 305. 
Empty space, 392. 

ds* for, 3«2, 383. 

Enrrgy and mass, 294, 298. 
- matter, 298. 

- — metric of F 4 , 328. 

- density, 350, 351, 356, 382. 
field of uniform, 426. 

— flux of, 350, 351, 356, 357. 358. 
intrinsic, of matter, 297, 298. 

— kinetic and potential, 296, 324. 
tensor, 355, 358. 

— and curvature, 374. 

— elect romagnetism, 374. 

— — equations of motion, 359. 

— - local phenomena, 374. 

metric of space-tune, 383. 

physical interpretation of, 359. 

vanishing, 382, 392. 

Envelope of family of planes, 101. 
Equation of orbit, 397. 

Equations, Einstein's gravitational, 376. 

— of motion, Einstemian modification 
of, 3Si. 

in terms of stress tensor, 351. 

-of free particle, 287. 

— relativistic, for continuous system, 
359- 

— total differential, 13-33. 
— integrals of, 47. 



Equipollence of vectors, 103. 
Equivalence, mechanical, a theorem of* 

394- 
e-systems, 158. 
Ether, 335. 

Euclidean manifold, 121, 

and Riemann's symbols, 242-246- 

— - — Christoffel's symbols in, 121. 

— metric manifold, condition for, 242- 
246. 

Experiment and coefficients of ds 2 t 363- 
368. 

Experiments, gravitational, and ds 2 9 367. , 

— optical, and ds 2 , 363. 
Extension of a field, 160. 

curved space, 426, 427, 428. 

Facet, 201. 

Fermat's principle, 335, 402. 

and geodesies, 341-343. 

in relativity, 340, 341. 

Focal directions in congruences, 283. 
Force absorbed in stress system, 349> 
374, 375. 

— disturbing, in planet's motion, 397- 

— in relativity field, 328. 

— inversely as cube of distance, 397 • 
Form, bilinear, 66. 

— invariant, 73. 

— linear, 67. 

— multilinear, 66, 83. 

— quadratic, 66. 
Forms of class 1, 253. 

conditions for, 257. 

Frame of reference, 335. 

Frequency of spectral line modified, 

400. 

Fresnel's convection coefficient, 319. 
I — - formula for velocity of light m 
moving media, 318-320. 
Function, alternate, 35. 

implicit, 3. 
— uniform, 14. 

Functional matrix, characteristic of, 

9, 39, 87, 250. 
Functions of position, 80, 83. 

Galilean systems, 349. 

force, stress, and divergence in, 

349- 

Gauss, on intrinsic geometry, 99. 
Gaussian curvature, 172. 
of V z% 193. 

General relativity, concept of, 294. 

postulates of, 364. 

Generalization of metric of V A , 320. 

Lagrangian function, 322-324. 

Geodesic, co-ordinates locally, 164, 167, 
171. 

— curvature, I35~ 1 37- 
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Geodesic, definition of, 103, 128. 

— deviation, 208-220. 

■ Jacobi's formula for, 219. 

— excess, 197. 

— manifold, 162. 

— ■ motion of particle in, 326. 

— principle, 337. 

— — Einstein's, 328, 331. 
in V 4 , 341. 

— sphere, as horizon, 434. 

— surface, 164. 

— triangle, 197. 

Geodesies and trajectories, 324, 326, 

— autoparallelism of, 104, 140. 

— differential equations of, 131-135. 

— in rigid motions, 408. 

— in space of constant curvature, 428. 

— Lagrange's equations for, 208, 331, 

332, 34*. 3^7, 413. 

— near gift-en geodesic, 208. 

— of zero length, 33<>-334f 337- 
Geometrical optics, 334. 

— - - according to Einstein, 335-338. 
Gzie (Einstein tensor), in statical ds~, 

380, 381. 

linear invariant G of, 380, 

381. 

G, linear invariant of Einstein tensor, 
38o, 381. 

Gravitation, modification of Newton's 
law of, 397. 

— not absorbed in energy tensor, 375. 

— with point mass, 419-423. 
Gravitational constant, 386. 

— equations, and the facts, 387. 
Einstein's, 376. 

— - — for spherical symmetry, 419. 

— - for statical 381. 

in space of constant curvature, 

428. 

modified by cosmological term, 

438. 

. rigorous solutions of, 437. 

solution of, 4 1 9-42-1. 

— experiments and ds 2 , 367. 

-- field and spectral lines, 400-402. 
path of light in, 403-408. 

— forces, as privileged, 374. 

— tensor, 371, 372. 
divergence of, 372. 

Hamilton's principle, 287. 

■ Einstein's form of, 291. 

modified, 294-298, 301, 322-324, 

35*- 

Horizon, in De Sitter's space* time, 434. 
Ilyperspherical representation, 258. 
Hvpersurface, 121. 

- — hyperspherical representation r<f, 258, 



Hypersurfaces in Euclidean space, 249, 

253. 

— parallel, 251. 

Immersion of V n in Euclidean space, 
121. 

Indefinite ds 2 , 141. 

Independence of functions, 5, 8-10. 

Inertia, index of, 299. 

— principle of, in relativity, 298. 
Inner multiplication of tensors, 79. 
Integral, general, 40, 42, 43, 45. 5©- 

— independent, 40, 42. 

— of differential equations, 36, 37. 

— principal, 38, 39- 49- 
Intrinsic geometry of surface, 99. 
Invariancc and Hamilton's principle, 

in relativity, 322. 

— of ds*, 308, 311. 

— ni-fold system, 69. 
■ — simple system, 67. 

— transformation by, 62. 
Invariant, dematives of, 83. 

- quadratic form, 73, 84. 
Isotropic manifolds, 232. 

Jacobi on geodesies, 208. 
Jacobian systems of equations, 52, 53. 
Jacobians, 2. See Determinant, func- 
tional. 

Kinematics, Galilean, 318. 

- of rigid systems, 301. 
modified, 303. 

- relativity, 311, 316. 
Kummer on congruences, 286. 

Lagrange and geodesies, 208, 331, 332, 

341, 3*>7> 413. 
Lagrangian binomials, 289. 

— equations, 289, 331, 332, 341, 367, 
413- 

— parameters, 288. 
Laplace's operator, 394. 

Law of gravitation, modifications of, 397. 
Light, constancy of velocity of, 335. 

— in gravitational field, frequency of, 
400-402. 

— path of, as trajectory, 403. 

-— in gravitational field, 403-408. 

— • propagation of, reversible, 365. 

- rays and trajectories, 34^. 

— - signals, 364. 

Local time, 290, 311, 312. 
Lorentz transformation, 300, 308, 310, 
316. 

invariance for, 352, 353, 354- 

— — most general, 313. 

— translation, 316. 
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Manifold, i. 

— Euclidean, 121. 

— geodesic, 162. 

— metric, 119. 

— M-dimensional, 119. 
— "sections of, 162. 
Manifolds, isotropic, 232. 

— of constant curvature, 236, 238, 240, 
246. 

their mutual applicability, 249. 

Mass and energy, 294, 298. 

— and metric of V tl 328. 

— and velocity, 295. 
Matrices, functional, 8-12. 
Matter, mean cosmic density of, 439. 

— total quantity of, 427, 428. 
Maximum and minimum, 128. 
Maxwell's theory, 383. 

Mayer's method of integration, 25. 
Mechanical equivalence, a theorem of, 
394- 

Mechanics, classical, correction to, 291- 
294, 320, 392. 

— generalized, 320-324. 

— of continuous systems. 347, 352. 

- — in covariant equations, 348, 349. 

— - — with any co-ordinates, 347. 
Metric, angular, 123. 

— of space-time and energy tensor, 383. 

— of V A and physical phenomena, 374. 
— . — generalization of, 320. 

-- pseudo-Euclidean, 209, 360. 
Metrical elements of figure, 100. 
Metrics, different, covariant derivatives 
for, 222. 

— — for same V n , 220. 

Riemann's symbols foi, 224. 

— in conformal representation, 229. 

— relativity, qualities of, 325. 
statical, 326. 

stationary, 326. 

— spatially uniform, 425. 

— with spherical symmetry, 408-414 
Michelson-Morley experiment, 335. 
Minimum time, principle of, 341. See 

Fermat's Principle. 
Mixed system, or tensor, 70, 71. 

— systems of total differential equations, 
29-33- 

Molecular action, system with no, 360- 
363. 

Moments of co-ordinate lines, 98. 

direction, 92, 120. 

covariance of, 92, 120. 

— relation connecting, 92, 120 

Momentum, 295. 

Morera's method of integration, 22-25. 
Motion, Einsteinian, of planets, 396. 
Multilinear form, 66, 69, 83. 
Multiplication of tensors, 76, 



Nebula?, density of, 439. 
Newtonian equations, 287, 377. 

— field, assigned, space-time for, 388- 
39*- 

— motion, differences from Einsteinian, 
377- 

— potential, 375. 

— potential and ds 2 f 336, 369. 
Normal congruence, 263, 275, 277, 285. 

— form of differential equations, 36, 

Operator A, properties of, 176. 

— linear, 33-37, 48, 84. 
Optics, geometrical, 334. 
Orbit, equation of, 397. 
Orthogonal directions, seta of, 205. 

Parallel, ambiental, 171. 

— displacement, 103. 

— — along a geodesic, 103, 104. 
angles unchanged by, 103, 114. 

— cyclic, 173, 186. 

— 0 f vector, 192. 

Peres's formula for, 193. 

Parallelism, 102. 

— and curvature, 193-108. 

— and infinitesimal displacement, 104.. 

— angle of, 198. 

-- differential definition of, 105. 

— equations of, 1 10-112. 

— extension of notion of, 137. 

- intrinsic character of, 106. 

- intrinsic equations of, 107. 
invanance of, no. 

— symbolic equation of, 107. 

— with respect to surface, 102. 
Parallelogram rule for vectors, 117. 
Parallels, kinematical construction of. 

102, 104. 

Parameter of family of surfaces, 45. 

— first differential, 231, 418. 

— second differential, 154, 393, 418. 
Parameters and moments, relation of, 

92, T2$. 

— ]>agrangian, 288. 

— of co-ordinate lines, 98. 

— of direction, 91, 120. 

■ - - contra variance of, 91, 120. 

relation connecting, 91, 120. 

Parametric equations of surface, 86. 
Path of light, in gravitational field, 402, 

403-408. 
Pejes's formula, 193. 
Perihelion, displacement of, 396, 398. 
formula for, 398. 

of Mercury, 399. 

of other planets, 400. 

Permutability (dd - 5d), 116. 
Perturbations, Newtonian, 399. 
Pfafrian, 13, 20, 26, 161, 174 
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Pfaffian as invariant, 81, 82. 

— systems, 14. 

Physical phenomena and metric of V 4 , 
374. 

Planet, motion of, 369. 

Planets, Einsteinian motion of, 396. 

— motion of, discrepancies in, 322. 
Poisson's equation and Einstein's theory, 

386, 387. 

— equation for potential, 375, 377. 

— parentheses, 35, 36. 
Postulates of general relativity, 364. 
Potential, Newtonian, 287, 292, 297, 

322, 323, 369. 375, 377, 388, 394, 396, 
400, 403. 
and ds 2 , 336, 369. 

— — and metric of space, 391. 
Potentials, 10 gravitational, 375. 
Principe expedition, 407. 
Product Of tensors, 76. 
Pseudo-Euclidean ds*, 325, 376. 
- — every metric locally, 360. 

— metric, 299, 360, 383. 

— - — and versors, 329. 

Quadratic, canonical form of, 205, 281. 

— differential form, invariant, 84. 

— form, 66. 

covariance of coefficients of, 73. 

definite, 90. 

— - — ds 2 , character of, 120. See ds 2 , 

— — invariant, 73. 

with non-vanishing discriminant, 

90. 

— forms of class zero, 242. 
of class i, 253. 

. Euclidean, 242. 

Riemann's symbols for, 242-246. 

theory of, 205. 

Radioactivity, 297. 
Radius of universe, 439. 
Reciprocal elements in determinants, 54, 
55, 80, 81. 

— tensors, 95. 

Refracting medium, space as, 402. 
Refraction of light, 334. 
Refractive index, 334. 
Relative motion, 313, 316. 
Relativity and Newtonian theory, differ- 
ences, 377. 

— composition of velocities in, 317. 

— general, and Poisson's equation, 386, 
387. 

postulates of, 364. 

— invariance in, 322. 

— kinematics of, 311, 3 * 6. 
metrics, qualities of, 325. 

■ statical, 326. 

— — stationary, 326. 



Relativity, postulates of, 364. 

— principle of, 311. 

— restricted, 300. 

— special theory of, 300. 
Reversible motion, 327. 

— transformation, 3, 7, 61. 
Reversibility of light propagation, 365. 
Ricci's coefficients of rotation, 268. 

— lemma, 148, 152. 

— symbols, 199, 372, 389, 411, 426. 

— tensor, 199. 

linear invariant of, 200, 380. 

Riemann- Chris tofTel tensor in V 4t 372. 

— 2Q components of, 372. See 

Riemann 1 s symbols. 
Riemannian curvature of V n , 195-198. 
Riemann's symbols, 172. 

and conformal representation, 228, 

246. 

and Euclidean metric manifold. 

242-246. 

— - - - Bianchi's identities in, 182. 

of first kind, 176, 179-182. 

of second kind, 175, 177, 178. 

Rigid motion in any manifold, 408. 
Romerian units, 307. 

Rotation, Ricci's coefficients of, 268. 
Rotor of vector, 161. 

Saturation (of indices). See (Contraction. 
Scalar product of vectors, 98, 126, 152. 
Scbur's theorem, 235. 
Schwarzschild's solution o* gravitational 
equations, 419-423. 

extensions of, 439. 

Second covariant derivatives, 184. 

— differential parameter, 154, 393. 

— fundamental form of V n> 252. 
Section of manifold, 163. 

— of V a , 201. 

Sets of orthogonal directions, 205. 

— of simple systems, 74, 156. 

— reciprocal, 74. 
Severi's theorem, 171. 
Shift, spectral, 400. 

Signals, light, and coefficients of ds 1 , 
364-366. 

Simultaneity, 290, 311. 

Sirius, spectrum <tf Companion of, 402. 

Sobral expedition, 407. 
j Solution of differential equations, 36, 
I 48. 

— — gravitational equations, Schwarzs- 
child's, 410-423. 

first approximation deduced 

from, 425. 

Solutions, rigorous, of gravitational 
equations, 437. 

Space, metric of, and Newtonian poten- 
tial, 391. 
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Space, non-Euclidean, 391. 

— of constant curvature, extension of, 
426, 427, 428. 

— - — gravitational equations in, 428. 
Space-time, 290. 

— an Einsteinian, ds* for, 392. 

— co-ordinate transformations, 290. 

— De Sitter's, 420-435. 

— De Sitter's, constant negative curva- 
ture of, 436. 

— Einstein's and De Sitter's, case in- 
cluding, 435. 

— Einstein's, curvature of space in, 438. 

— Einstein's cylindrical, 429. 

— metric, and energy tensor, 374. 

— with assigned Newtonian field, 392. 
Spatially uniform metrics, 425. 
Spectral displacement, 400. 

Sphere, geodesic, in F a , 409, 410. 
Spherical symmetry and gravitational 
equations , 419, 

metrics with, 408-414. 

Statical ds\ 326, 327, 37i, 377, 378, y)2. 

— field, 400. 

— metrics, 326, 327. 
Stationary merries, 326, 327. 

Fermat's principle for, 340. 

Stre.* 344. 

— and bilineai form, 345. 

— force absorbed in, 349. 

— in spatially symmetrical metrics, 425, 
429, 430, 436. 437, 438- 

— kinetic, 351, 356, 35 8 - 

— normal, 345. 

— tensor and equations of motion, 
35 1- 

divergence of, 344, 346, 354. 

in classical theoiy, 344. 

in generalized co-ordinates, 346. 

interpretation of divergence, 346. 

Sum of tensors, 70. 
Surface, geodesic, 164. 

— - intrinsic geometry of, 99. 

— — parametric equations of, 86. 
- vectors, 96. 

Surfaces, developable, 100,101. 
Symmetrical double systems, 72. 

— systems (tensors), 65. 
Symmetry, spherical, and gravitational 

equations, 41 y 

metrics with, 408-414. 

System, mixed, 70, 71. 

Systems (tensors), antisymmetrical, 73. 

— double, 65. 

— m-fold, 65. 

— of order m, 65. 

— of order zero, 65. 

— symmetrical, 65, 

Tensor, 70, 71. 



Tensor, Einstein's, 200, 371. See Ein» 
stein's tensor. 

— energy, 355. 

and equations of motion, 359. 

— first general definition of, 80. 

— gravitational, 371. 

10 components of, 372. 

— Riemannian, 371. See Riemann't 
symbols. 

— second general definition of, 83. 

— stress, divergence of, 344, 346, 354. 
with vanishing elements, 71. 

Tensors, addition of, 75. 

— associated, 95, 96. 

— composition of, 79. 

— contraction of, 77—79. 

— inner multiplication of, 70. 

— multiplication of, 76. 

— reciprocal, 95. 

Third fundamental form of V n , 259. 
Time, conventional, 364. 

— local, 290, 311, 312. 
Totai differential, 13. 
equations, 13-33. 

equations, complete system of> 

i5-r8. 
Trajectories, 403. 

— and geodesies, 324, 326. 

— and light rays, 343. 

— - Einsteinian and Newtonian, 395. 

— in generalized mechanics, 324. 

— orthogonal, 263. 
Transformation, affine, 304, 305. 

— by contravariance, 67, 09-71. 

— by covariance, 64, 67-71. 

— by invariance, 62. 

— formula.* of, 80. 

— nomographic, 304 

— - linear, of differentials, 80. 

— of derivatives, 85. 
Transformations, linear, 67. 

— Lorentz, 300, 308, 310, 316. 

— - reversible, 3, 7, 61. 

— - snace-time co-ordinate, 290. 
Translation, motion of, 305. 

Universe, radius of, 439. 

Variations, Point are's equation of, 208. 

Variety. See Manifold. 

Vector, contr^variant and covariant 

components of, 97. 
in V nt 120, 127. 

— derivative of, 139, 140. 

— determination of, by invariants, 266. 

— product, 159. 

— product of versors, 201. 

— projection of, in V, it 127. 

— transformation of, 62, 63, 64. 
Vectors, equipollence of, 103. 
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Vectors, parallel and equal, i03-„ 
— - scalar product of, 98, 1264 
— - surface^ or tangential, 96. 

— Aero, ypi 

Velocities absolute and relative,, 316- 
3iu. 

— composition of, 306. 

r~ according to Einstein, 317, 

Velocity, earth's orbital, 399. 

— laTge universal constant, 291, 292, 

— mass and, 295. ¥ 

— of light, 292, 306, 3". 334, 335, 339. 
382, 399- 

— — irreversible, 340. 

law of variation of, 339. 

non-ay mmetrical, 340, 



Vcrsor (unit vector, direction), 92, 96, 
98, 102, 103, 123, 125, 126, 140. 

— in f^ 4 , and corresponding vector, 
329* y 

Versors, and pseudo-Euclidean metrics, 
329. 

— spacelike, 330. 

— timelike, 330. 
Vibration of atom, 400. 
Volume of curved space, 427, 428* 

World lines, 290, 329, 35a, 353. 

of light, 337, 364. 

parameters of, 353. 

and stress tensor, 353. 

Zero vectors, 97. 



